Transition Based Dependency Parsing with Deep LearningTransition Based Dependency Parsing with Deep...

Transition Based Dependency Parsing with DeepLearning

Omer Kırnap

Koc University

okirnapkuedutr

September 27 2018

Omer Kırnap (Koc University) MSc Thesis September 27 2018 1 123

Overview

1 IntroductionOverview of Dependency ParsingTransition Based Dependency Parsing

2 Related WorkLinear Models and their DrawbacksNeural Network Models

3 ModelLanguage ModelMLP ParserTree-stack LSTM Parser

4 ResultsMLP vs Tree-stack LSTMMorphological Feature EmbeddingsStatic vs Dynamic Oracle TrainingTransfer Learning

5 Conclusion6 Future Work amp Discussions


1 Introduction


Introduction

What is dependency parsing

Dependency parsing aims to detect word relations by finding the treestructure of a sentence inspired by dependency grammar

Figure Dependency annotations for a sentence ldquo Economic news had little effecton financial marketsrdquo

1

1Figure from S Kbler R McDonald and J Nivre 2009 Dependency parsingMorgan amp Claypool US


Introduction

Why do we need dependency parsing

Dependencies resolve ambiguity

Useful for some down-stream tasks in NLP

2

2Figure from httpwwwphontroncomslidesnlp-programming-en-11-dependpdfOmer Kırnap (Koc University) MSc Thesis September 27 2018 5 123

Introduction

Dependency Parsing Categorization

Grammar BasedRelying on a formal grammardefining a formal languageasking whether a given inputsentence is in the languagedefined by the grammar or not

Data-drivenMaking essential use of machinelearning from linguistic data in orderto parse new sentences

3

3From S Kbler R McDonald and J Nivre 2009 Dependency parsing Morgan ampClaypool US


Introduction

Data-driven Dependency Parsing

Graph Based Algorithms

Using maximum spanning tree algorithms from graph theory

Transition Based Algorithms

Capitalizing on greedy stack based algorithms to build dependency treewith incremental steps in linear time

4



Introduction

Transition Based Dependency Parsing

Transition System Abstract machine with a set of configurations(states) and transitions We use the ArcHybrid transition system[Kuhlmann et al 2011]

Configurations (σ β A)bull σ Stack of tree fragments initially emptybull β Buffer of words initially containing the whole sentencebull A Set of dependency arcs (head relation modifier) initially empty

Transitionsbull shift(σ b|βA) = (σ|b βA)bull leftd(σ|s b|βA) = (σ b|βA cup (b d s))bull rightd(σ|s|t βA) = (σ|s βA cup (s d t))


An example parsing of a sentence
















Problem Definition

Find a model learning to decide correct transition from current state


2 Related Work


Related Work


Related Work


Related Work


Related Work

Neural Networks for Feature Conjunctions

Neural networks can handle feature conjunctions and nonlinearityHoweverImpractical for high dimensional inputs they scale linearly in inputdimensions (in both time and space assuming fixed number of hiddenunits)


Related Work

Solution Using dense embeddings for input features


Overview







3 Model


Model Overview

2 Shared Tasks for Multilingual Parsing from Raw Text to UniversalDependencies

CoNLL17bull Koc-University team with MLP Parser using Context Embeddings

CoNLL18bull KParse team with Tree-stack LSTM Parser using Context and

Morph-feat Embeddings


Model Overview


CoNLL17

bull Koc-University team with MLP Parser using ContextEmbeddings

CoNLL18

bull KParse team with Tree-stack LSTM Parser using Context and Morph-featEmbeddings


a Language Model


Language Model (LM)

LM is used to obtain Context and Word embeddings with twocomponents

Character Based LSTM extracts word vectors

Word Based BiLSTM extracts context vectors


Language Model - Word vectors

Character based LSTM generates word Vectors

Figure Character LSTM from Kırnap et al 2017


Language Model - Context Vectors

Word based BiLSTM generates Context Vectors

Figure Word BiLSTM from Kırnap et al 2017


b MLP Parser (CoNLL17)


MLP Parser

MLP Parser consists of 4 components



Feature extractor describes current state

Decision module (MLP) decides the next transition


MLP Parser - Feature Extraction


Figure Kırnap et al 2017Omer Kırnap (Koc University) MSc Thesis September 27 2018 41 123

MLP Parser - Decision Module



Experiments amp Dataset (MLP) CoNLL17

CoNLL17 Dataset

Dependency parsing of 81 treebanks in 49 languages

All treebanks use standardized annotationbull 17 universal part-of-speech tagsbull 37 universal dependency relations


Experiments - Evaluation Metric

Labeled Attachment Score (LAS)The percentage of words correctly assigned both the correct syntactic headand the correct dependency label

Economic news hadGold Tree LAS 1

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

OBJATTPred 2 LAS (frac12)100


Experiments (MLP)

CoNLL 2017 Results (all treebanks LAS)

Ranked 1st among transition based parsers 5

5Source CoNLL17 official results pageOmer Kırnap (Koc University) MSc Thesis September 27 2018 45 123

Contributions in CoNLL17


Context and Word Embeddings

Relative contributions of part-of-speech (p) word vector (v)context vector (c)

Feats Hungarian En-ParTUT Latvianp 636 766 559

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Context vectors provide independent contribution on top ofPOS tags


Context and Word embeddings


v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

Our BiLSTM language model word vectors perform betterthan FB vectors




v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Both POS tags and context vectors have significantcontributions on top of word vectors


Issues with MLP

However

Choosing correct state of parser still remains critical

We are unable to represent whole parsing history with featureextracting


Solution

Find a recurrent architecture such that it can summarize the parsinghistory as well as word sequences in a buffer and stack


Model Overview


CoNLL17

bull Koc-University team with MLP Parser using Context Embeddings

CoNLL18

bull KParse team with Tree-stack LSTM Parser usingContext and Morph-feat Embeddings


c Tree-stack LSTM Parser (CoNLL18)


Related Work - Stack LSTM

Figure Stack LSTM [Dyer et al 2015]

Represent each component (σ β A) with an LSTMModifying head wordrsquos embedding with dependent embedding


Problems with Stack LSTM

They only modify stackrsquos word embeddings

Hidden states of LSTMS are not updated unless reduce

Actions are not explicitly represented

They only used word2vec embeddings [Mikolov et al 2013]


Our solution

We propose

Context embeddings should improve parsing accuracy

Dependency relations should be explicitly represented

Morphological Features of a word may enhance parsing accuracy


Tree-stack LSTM Overview

t-RNN

Head word

Dependent word Dependency Relation

LSTM LSTM LSTM LSTM LSTM

LSTM LSTM A

Concat

MLP

We propose Tree-stack LSTM model with 4 components

β-LSTMσ-LSTMAction-LSTMTree-RNN


Tree-stack LSTM

Input Representation



Action and Dependency Relation Embeddings

Every action is represented with continuous vector

Every dependency relation is represented with continuous vector



We do not include explicit feature extractor We initiated wordrepresentation by concatenating

Character Based LSTMrsquos word vectors

Word Based BiLSTMrsquos context vectors

Part-of-speech (POS) vectors

Morph-feat vectors



Morp-feat Vectors

Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs IT It

Figure Morph-feat Embeddings


Tree-stack LSTM

Model Components1 β-LSTM2 σ-LSTM3 Action-LSTM4 Tree-RNN


β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

Figure Bufferrsquos β-LSTM


σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Figure Stackrsquos σ-LSTM


Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM


How do components of tree-stack LSTM are connected


Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN

whead new = tanh(Wrnn lowast [whead old dl wdep] + brnn) (1)


Tree-RNN with

1 Left Transition2 Right Transition


Left Transition


Transitions - Left

leftd(σ|s b|βA) = (σ b|βA cup (b d s))

LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent

Figure Each embedding initiated by concatenating POS language andmorph-feat embeddings


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

Figure Stackrsquos top LSTM is reducedOmer Kırnap (Koc University) MSc Thesis September 27 2018 76 123

Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head

Figure t-RNN calculates new head embedding


Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head

Figure β-LSTM recalculates its hidden based on new input


Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head

Figure Tree-stack LSTM is ready to give new transition


Right Transition


Transitions - Right

rightd(σ|s|t βA) = (σ|s βA cup (s d t))

LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

Figure Stackrsquos top LSTM is reduced


Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head

Figure σ-LSTM recalculates its hidden from new input


Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Final overview of Tree-stack LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview







4 Results amp Comparisons


Results amp Comparisons

Dataset

Dependency parsing of 81

treebanks in 49 languages

All treebanks use standardized

annotation

17 universal

part-of-speech tags

37 universal dependency

relations

Koc-University ranked 7th out

of 33 participants (1st among

transition based parsers)



All treebanks use

standardized annotation

17 universal

part-of-speech tags


relations

Koc-University ranked 16th

out of 30 participants (2nd

among transition based

parsers)

CoNLL17 CoNLL181 Traintest split change 2 Annotation


MLP vs Tree-stack LSTM

CoNLL 2018 committee released comparison results of CoNLL17 andCoNLL18 systems tested under the same test sets



2 possible problems of official comparison

1 If the annotation of the tree bank is improved the older parser ishandicapped

2 If the training-test split has changed and the old training data arenow in test data the old parser is favored undeservedly



Experiments with the same train-test datasets to compare models

Lang Code MLP Tree-stackru taiga (10k) 5889 6055hu szeged (20k) 6621 6818tr imst (50k) 5678 5875ar padt (120k) 6783 6814en ewt (205k) 7487 7577cs cac (473k) 8339 8357

Tree-stack LSTM outperforms MLP


Ablation Analysis of Tree-stack LSTM

An evolution from MLP to Tree-stack LSTM


MLP Parser

MLP

Figure Initial model


Only Action LSTM

LSTM LSTM

Figure Only action LSTM


Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM


Ablation Analysis Results

Lang Code MLP Only Action Only-β Only-σhu szeged 6621 6687 6694 6703sv lines 7112 7205 7217 7245tr imst 5712 5687 5702 5712ar padt 6783 6667 6689 6692

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Table Comparison between MLP and rdquoOnlyrdquo models


Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN

Comparison of stack-LSTMs with and without t-RNN

Lang Code without t-RNN with t-RNNno nynorsklia (3k) 5178 5333ru taiga (11k) 5913 6055gl treegal (15k) 6976 7045hu szeged (20k) 6612 6818sv lines (49k) 7404 7546tr imst (50k) 5812 5875

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

t-RNN provides comparative advantage for low-resourcelanguages


Ablation Analysis

Overall results of ablation analysis

Lang MLP Only A Only-β Only-σ wot-RNN allhu szeged 6621 6687 6694 6703 6612 6818sv lines 7112 7205 7217 7404 7217 7546tr imst 5712 5687 5702 5712 5812 5875ar padt 6783 6667 6689 6692 6804 6814cs cac 8389 8223 8313 8317 8289 8357en ewt 7554 7543 7556 7567 7487 7577

Tree-stack LSTM beats other model variations


Ablation Analysis

Conclusions of Ablation Experiments

t-RNNrsquos performance contribution increases when the training sizedecreases

σ-LSTM provides more useful information independent from datasetsize

Interconnecting modelrsquos component with t-RNN makes tree-stackLSTM more powerful for low-resource languages (ranked 10th of alland 2nd among transition based parsers)


What does Morphological Feature Embedding provide


Contribution of Morph-feat Embeddings

Experimental SettingsWe divide Conll18 UD dataset 22 into 4 parts based on number oftraining tokens for each language to better understand our contributions

Languages having less than 20k tokens

Languages having more than 20k less than 50k tokens


Languages having 100k tokens or more


Contribution of Morph-feat embeddings

Morp-feat experiments for languages having less than 20k training tokens

Lang code Morph-Feats no Morph-Feats of tokensno nynorsklia 5113 5333 3583

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

Not useful for languages having less than 20k training tokens



Morp-feat experiments for languages having tokens in between 50k and100k

Lang code Morph-Feats no Morph-Feats of tokenssv lines 7218 7481 48325

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

nl lassymal 767 758 75134

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

Beneficial for languages with 50k-100k training tokens



Morp-feat experiments for languages having more than 100k trainingtokens

Lang code Morph-Feats no Morph-Feats of tokensfa seraji 8118 8112 121064

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Neutral for languages having more than 100k training tokens


Static vs Dynamic Oracle Training

Static oracle transitions using gold movesDynamic oracle transitions using predicted moves

In both cases logp of gold moves maximized

t-RNN

Head word



LSTM LSTM A

Concat

MLP



Figure Results are very close for training tokens less than 20k



Figure Results are very close for training tokens in between 20k and 50k



Figure Results are very close for training tokens more than 50k


How about languages with less than 20k training tokens


Transfer Learning

There are 4 possible types of transfer learning1 Using very limited data to train LM for word and context vectors and

use them to train a parser from scratch2 Using Facebookrsquos word vectors to train a parser [Bojanowski et al

2017]3 Using my own word and context vectors trained with different

language but from the same language family4 Applying transfer learning with a pre-trained parser

Language (1) (2) (3) (4)af afribooms not provided 7546 7743 7812kk ktb 2019 2231 2196 2386bxr bdt 764 976 993 898

kmr mg 2012 2257 2278 2339

Table LAS values for strategies (1) (2) (3) and (4)


Transfer Learning

Conclusions of Transfer Learning Experiments

Applying transfer learning with a pre-trained parser is the mostbeneficial

From scratch LM training does not bring useful word and contextvectors

Our word and context vectors are still more useful than Facebookrsquos[Bojanowski et al 2017]


Projectivity

Transition Based Parser can only build projective trees 6

6Figure fromhttpstplingfiluuse sarakurser5LN455-2014lectures5LN455-F8pdf


Projective vs Non-projective

We compared our model with the best model for different projectivityratios

Language Projectiviy Best (LAS) Our (LAS)

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Table Our models performance gap decreases as the projectivity ratio increases

7

7From official results page and our projectivity tableOmer Kırnap (Koc University) MSc Thesis September 27 2018 116 123

Conclusions


Conclusion

In conclusionWe introduced ldquoContext Word and Morph-featrdquo embeddings and showedtheir contribution in transition based dependency parsing

Our Tree-stack LSTM outperformed MLP by removing hand-craftedfeature engineering

Tree-stack LSTM performed better with low resource languages

When the training dataset size increases tree-stack LSTM losses itsadvantage


Future Research Direction

End-to-End Training

Systems that are jointly trained for tokenization morphological taggingand dependency parsing performed better Some are also jointly trained alanguage model together with pre-trained embeddings

Attention Mechanism

Applying attention in between σ-LSTM states or β-LSTM orAction-LSTM may bring performance improvement

Morphological Features

Finding different way to represent morphological features

Dynamic Oracle vs Beam Training

Although I tried both of them I could not obtain performanceimprovement There may be convergence problems with our loss functionand another losses (CRF) may solve this problem


Publications

Omer Kırnap Erenay Dayanık and Deniz Yuret 2018 Tree-stackLSTM in Transition Based Dependency Parsing In Proceedings ofthe CoNLL 2018 Shared Task Multilingual Parsing from Raw Text toUniversal Dependencies

Omer Kırnap Berkay Furkan Onder and Deniz Yuret 2017 Parsingwith Context Embeddings In Proceedings of the CoNLL 2017 SharedTask Multilingual Parsing from Raw Text to Universal Dependencies


References

Marco Kuhlmann Carlos Gomez-Rodriguez and Giorgio Satta 2011Dynamic programming algorithms for transition-based dependencyparsers In Proceedings of the 49th Annual Meeting of theAssociation for Computational Linguistics Human LanguageTechnologies-Volume 1 Association for Computational Linguisticspages 673682

S Kbler R McDonald and J Nivre 2009 Dependency parsingMorgan amp Claypool US

Chris Dyer Miguel Ballesteros Wang Ling Austin Matthews andNoah A Smith 2015 Transition based dependency parsing withstack long-short term memory CoRR abs150508075


Thank you for your attention


Questions


Introduction

Overview of Dependency Parsing


Related Work

Linear Models and their Drawbacks

Neural Network Models

Model

Language Model

MLP Parser

Tree-stack LSTM Parser

Results


Morphological Feature Embeddings


Transfer Learning

Conclusion

Future Work amp Discussions

Overview







1 Introduction


Introduction




1



Introduction




2


Introduction




3



Introduction






4



Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


1 Introduction


Introduction




1



Introduction




2


Introduction




3



Introduction






4



Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction




1



Introduction




2


Introduction




3



Introduction






4



Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction




2


Introduction




3



Introduction






4



Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction




3



Introduction






4



Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction






4



Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction






















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


















Problem Definition



2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


2 Related Work


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work




Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work



Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Overview







3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


3 Model


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Model Overview






Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Model Overview


CoNLL17


CoNLL18



a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


a Language Model


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Language Model (LM)















MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion












MLP Parser














CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion









CoNLL17 Dataset







SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had



Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Experiments (MLP)









v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion







v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion




v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742





v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion




v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742



Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Issues with MLP

However




Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Solution



Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Model Overview


CoNLL17


CoNLL18















Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion














Our solution

We propose






t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion



t-RNN

Head word



LSTM LSTM A

Concat

MLP




Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-stack LSTM













Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion












Morph-feat vectors



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion



Morp-feat Vectors




Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-stack LSTM



β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


β-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi



σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


σ-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2



Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Action-LSTM

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion




Tree-RNN


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-RNN (t-RNN)

t-RNN

Dependent word

Dependency Relation

Head word

Figure t-RNN



Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-RNN with



Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Left Transition


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left


LSTM LSTM LSTM LSTM

Left transition

t-RNN

Dependency Relation

HeadDependent



Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left


LSTM LSTM

LSTM

LSTM

Left transition

t-RNN

Dependency Relation

Head

Dependent

New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left


LSTM LSTM LSTM

Left transition

t-RNN New Head



Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Right Transition


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right


LSTM LSTM LSTM LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent



Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right


LSTM LSTM

LSTM

LSTM

t-RNN

Dependency Relation

Right Transition

Head

Dependent

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head



Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right


LSTM LSTM LSTM

t-RNN

Right Transition

New Head




t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion



t-RNN

Head word



LSTM LSTM A

Concat

MLP


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Overview










Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





Dataset




annotation

17 universal

part-of-speech tags


relations






All treebanks use


17 universal

part-of-speech tags


relations




parsers)



















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


















MLP Parser

MLP



Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Only Action LSTM

LSTM LSTM



Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion




cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567



Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation of t-RNN

t-RNN

Head word



LSTM LSTM A

Concat

MLP


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation of t-RNN



ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164



Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation Analysis





Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation Analysis


















ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion














ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166






fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974


gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531






bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282






t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





t-RNN

Head word



LSTM LSTM A

Concat

MLP













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion













Transfer Learning






kmr mg 2012 2257 2278 2339



Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning






Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Projectivity







grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion





grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)


7


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Conclusions


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Conclusion







End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion



End-to-End Training


Attention Mechanism







Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Publications




References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


References







Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion




Questions


Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transition Based Dependency Parsing with Deep LearningTransition Based Dependency Parsing with Deep...

Documents

Transcript of Transition Based Dependency Parsing with Deep LearningTransition Based Dependency Parsing with Deep...