Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2...

33
Minimiza’on and Probability Distribu’on of Dependency Distance in the Process of Second Language Acquisi’on Jinghui Ouyang, Jingyang Jiang Department of Linguis6cs Zhejiang University

Transcript of Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2...

Page 1: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Minimiza'onandProbabilityDistribu'onofDependencyDistanceintheProcessof

SecondLanguageAcquisi'on

JinghuiOuyang,JingyangJiangDepartmentofLinguis6cs

ZhejiangUniversity

Page 2: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

1.  TheBackgroundofthecurrentpaper

2.  Minimiza6onandProbabilityDistribu6onofDependencyDistanceintheProcessofSecondLanguageAcquisi6on

Page 3: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  1.PartoftheKeyProjectofNa6onalSocialScienceFounda6on

“ResearchontheSyntac6cDevelopmentofChineseEFLLearnersBasedonDependencyTreebank”(2017-2022)

•  2.Wehave collected about 1200 composi6ons, 150thousand words, ranging from grade 4 in primaryschools(11yearsold)tocolleges(11grades).

The Background of the current paper

Page 4: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

ImportantConcepts

Page 5: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Dependency Relation

In the syntactic analysis framework of dependency syntax, sentence structure is analyzed using the dependency relations between words in a sentence (Tesnière, 1959; Hudson, 2007, 2010; Nivre, 2006; Liu, 2009). A dependency relation has three core properties: binary, asymmetry, and labeledness.

Page 6: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Dependency Distance

Dependencydistancereferstothe lineardistancebetweentwolinguis6cunits having a syntac6c rela6onship within a sentence (Heringer et al.,1980;Hudson,1995).

Page 7: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Dependency Distance

Mean Dependency Distance (MDD) Themeandependencydistance(MDD)ofasentence:

Themeandependencydistance(MDD)ofatreebank:

Dependencydistancereferstothe lineardistancebetweentwolinguis6cunits having a syntac6c rela6onship within a sentence (Heringer et al.,1980;Hudson,1995).

Page 8: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Dependency Distance Minimization (DDM)

•  The linear distance between two words with syntac6crela6onshipisrestrainedbyhumanworkingmemory.

•  The results of corpus-based research and psychologicalexperimentshaveindicatedthathumanlanguageshaveatendency towards dependency distance minimiza6on(DDM).

•  Dependencydistanceminimiza6onisfoundasauniversalquan6ta6vepropertyofmore than30human languages(Liu,2008,Futrelletal.2015).

Page 9: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Introduc'on

Page 10: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  The materials of previous studies on DDM are allfromna6vespeakers’firstlanguage,butthereisnosuch study on second language learners’ languagesystem.

•  Second language learners’ language system,defined as “interlanguage”, is a structurallyintermediatestatusbetweenthena6veandtargetlanguages(Selinker1972).

•  Along with the improvement of their secondlanguage proficiency, learners’ language systemgradually develops towards na6ve speakers’language.

•  Does second language learners’ language systemalsodevelopundertheuniversalpressureofDDM?

Page 11: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Tofigureouthowsecondlanguagelearnersobeytheprincipleofdependencydistanceminimiza6onduringtheprocessofsecondlanguageacquisi6on,weinves6gatedthedevelopmentofMDDsin Chinese EFL (English as a Foreign Language) learners’ Englishwri6ngs at different learning phases to answer the first twospecificresearchques6ons:

•  Research Ques'on 1. How does the mean dependencydistance(MDD)intheEnglishcomposi6onswrihenbyChineseEFLlearnersdevelopacrossninegrades?

•  ResearchQues'on2.DoesChineseEFLlearnersdeveloptheirEnglishproficiencyunderthepressureofdependencydistanceminimiza6on?

Page 12: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  Dependency distance can reflect the comprehensiondifficulty of syntac6c structure (Liu 2008). Dependencydistance minimiza6on is considered as resul6ng fromhumancogni6vemechanism(Liu2008,Luetal.2016)andthe effect of ‘the principle of least effort’ on syntac6cstructure (Zipf 1949). The distribu6on of dependencydistancespresentscertainregularity.

•  Ourpreviousstudy(Ouyang&Jiang2017)foundthattheprobability distribu6on of the dependency distance ofsecond language learners’ interlanguage can well fit theZipf-Alekseevdistribu6onand theparametersa andb intheZipf-Alekseevdistribu6onwellreflectsecondlanguagelearners’languageproficiencyatdifferentlearningstages.

Page 13: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  Toconfirmthatthepreviousfindings(Ouyang&Jiang2017)arenotsta6s6calartefactsandtofurtherdemonstratetheDDM of interlanguage in the process of second languageacquisi6on (SLA) from the probability of dependencydistances,weconstructedtworandomlanguages(RL1,RL2)using the composi6ons at different learning stages andfihedtheirprobabilityofdependencydistancestodifferentexponen6aldistribu6onandpowerlawdistribu6onmodels,includingtheZipf-Alekseevdistribu6on.

•  Research Ques'on 3. Does the probability distribu6on ofdependency distance of random languages of secondlanguage learners’ wri6ngs well fit the Zipf-Alekseevdistribu6on?Iftheanswerisyes,cantheparametersintheZipf-Alekseev distribu6on well reflect second languagelearners’languageproficiencyatdifferentlearningstages?

Page 14: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Methodology

Page 15: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Participants

Group Number YearsofEnglishLearning

J1(firstgradeofjuniorhighschool) 75 3-4

J2(secondgradeofjuniorhighschool) 61 4-5

J3(thirdgradeofjuniorhighschool) 69 5-6

S1(firstgradeofseniorhighschool) 78 6-7

S2(secondgradeofseniorhighschool) 74 7-8

S3(thirdgradeofseniorhighschool) 79 8-9

U1(firstgradeofuniversity) 40 9-10

U2(secondgradeofuniversity) 28 10-11

P1(FirstgradepostgraduateofEnglishmajor) 26 13-14

· First graders of junior school—First gradepostgraduatesofEnglishmajor· 367 Chinese students from two high schools and one university inZhejiangProvince

Page 16: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Materials

Group Topic Genre Sampledcomposi'ons Wordcount

J1 My Weekend Narrative 60 6375

J2 My Weekend Narrative 60 6375

J3 My Weekend Narrative 44 6417

S1 A(n) Embarrassing/ Surprising/Unforgettable Thing Narrative 44 6307

S2 A(n) Embarrassing/ /Surprising/Unforgettable Thing Narrative 39 6312

S3 A(n) Embarrassing/ Surprising/ Unforgettable Thing Narrative 41 6358

U1 An Interesting/Annoying/ Embarrassing Story Narrative 25 6539

U2 An Interesting/Annoying/ Embarrassing Story Narrative 28 6431

P1 An Interesting/Annoying/ Embarrassing Story Narrative 26 7469

Total Narrative 341 58583

Self-built dependency treebank: 341 English composi6ons wrihenpar6cipantswithintheprescribed6melimitintheclassContras6ve dependency treebank: sub-corpora with about 6500wordsofeachcorpusfromtheWallStreetJournal(WSJ)Corpus

Page 17: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Contents

Procedure

u Data Collection

u Automatic POS and Dependency Relation Annotation

u Establishment of Syntactic Relation and Error Tagging System

u Manual Tagging and Modification

Page 18: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Contents Manual Syntactic Annotation and Modification WordOrder Word POS WordOrder

ofGovernor Governor POSofGovernor

DependencyRela'on

DependencyDistance

1 For IN-case-E 5 think VBP prep 4

2 students NN 1 For IN pobj -1

3 , , 5 think VBP punct 2

4 I PRP 5 think VBP nsubj 1

5 think VBP 5 think VBP root 0

6 we PRP 7 are VBP nsubj 1

7 are VBP 5 think VBP ccomp -2

8 stressed JJ 7 are VBP xcomp -1

9 out RP 8 stressed JJ compound:prt -1

10 . . 5 think VBP punct -5

Modifica6on:1.  Automa6csyntac6cannota6oninconsistentwithoursyntac6c

rela6onsystem2.  Wrongautoma6ctagging3.  Lexicalandgramma6calerrors

Page 19: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

ConstructTwoRandomTreebanks

Inthefirstrandomtreebank(RL1),withineachsentencewe select oneword as the root, andthen for every otherwordwe randomly selectanother word in the same sentence as itsgovernor,disregardingsyntaxandmeaning.

In the second random treebank (RL2), whilegovernorsareassignedrandomly,wemakesurethattheresultantdependencytree(graph) isaprojec6veandconnected tree, i.e.,nocrossingarcsareallowedinthegraph.

Page 20: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Contents The Zipf-Alekseev Distribution

Page 21: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Results&Discussions

Page 22: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

MDDatDifferentGrades

Junior high school: The MDD of Chinese EFL learners’ English writings increasessigni/icantly (p=0.000) from J1 (1.841) to J2 (2.061), butstays stable (p=0.936>0.05)fromJ2(2.061)toJ3(2.064).

Seniorhighschool: TheMDDofChineseEFLlearner’Englishwritings/irstincreasessigni/icantly (p=0.003) at S1 (2.188), then continues increasing insigni/icantly(p=0.445>0.05)atS2,butexperiencesasigni/icant(p=0.022)decreaseatS3(2.125).

University:theMDDoftheirwritingsincreasessigni/icantly(p=0.000)atPirst,butthenkeepssteady(p=0.782>0.005).

Page 23: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

MDDofChinesepostgraduatesofEnglishmajor&contras'vesub-corpora

TheresultsofindependentT-testshowthatthereexistsigni/icantdifferences(t(12490)=-1.426,p=0.002<0.05; t(12471)=-1.089, p=0.017<0.05; t(12223)=-3.047, p=0.000<0.01; t(12302)=-1.628,p=0.000<0.01) between the dependency distances in English writings by ChinesepostgraduatesofEnglishmajor(2.461±2.614)andthoseinfourcontrastivesub-corpora:WSJ1(2.532±3.056),WSJ2(2.516±2.952),WSJ3(2.625±3.296)andWSJ4(2.545±3.097)

TheMDDofhigh-levelChineseEFL learners(postgraduateofEnglishmajor)hasn’treachedthelevelofEnglishnativespeakers.

Page 24: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

MDDofinterlanguageandrandomlanguages

ThetworandomlanguageshavemuchgreaterMDDsthannaturallanguage(NL)ofChineselearners.Ofthetworandomlanguages,RL2hasasmallerMDDthanRL1.

Page 25: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Thedistribu'onofdependencydistancesofNL(Ouyang&Jiang,2017)

Theprobabilitydistribu6onofdependencydistance of second language learners’interlanguage well fits the Zipf-Alekseevdistribu6onandtheparametersaandbinthe Zipf-Alekseev distribu6on well reflectsecond language learners’ languageproficiencyatdifferentlearningstages.

Page 26: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Thedistribu'onofdependencydistancesofRL1

Thirteendistribu6oncurvesareall concave down. But thefiqng resultsofRL1 show thatthe dependency distances ofthirteen groups of RL1 cannotwell fit one same probabilitydistribu6on.

Page 27: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Thedistribu'onofdependencydistancesofRL2

The thirteen distribu6on curves ofRL2areallconcavedown.The fiqng results show that thedependency distances of thirteengroups of RL2 can fit the followingprobability distribu6ons: Righttruncated modified Zipf-Alekseev(a, b; n=x-max, α fixed), Nega6vebinomial (k, p), Right truncatednega6ve binomial (k, p; R=x-max),Mixednega6vebinomial (k,p1,p2,α), InversePolya(a,k,p),Extendedposi6ve nega6ve binomial (k, p; αfixed),Mixedgeometric (q1,q2,α),and Mixed geometric-logarithmic(q,β,α).

Page 28: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Althoughthedistribu6onofdependencydistancesofRL2canwellfittheRighttruncated modified Zipf-Alekseev distribu6on, the parameters have nocorrela6onwiththegrades.

Nocorrela'onbetweentheparameteraandthegrades(R2=0.350,p>0.05),nocorrela6onbetweentheparameterbandthegrades(R2=0.119,p>0.05)andnocorrela6onbetweentheparameterαandthegrades(R2=0.074,p>0.05).

The varia'ons of parameters (a, b, α) of the Right truncatedmodifiedZipf-Alekseevfi^ngthedependencydistancesofRL2

Page 29: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

Conclusions&implica'ons

Page 30: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  The MDD of Chinese EFL learners’ English wri6ngs increasessignificantlyacrossninegrades.

•  The MDD of high-level Chinese EFL learners (postgraduate ofEnglishmajor)doesn’treachthelevelofEnglishna6vespeakers.

•  TheMDDofChineseEFLlearners’Englishwri6ngsremainstableattheuniversitylevel.(fossiliza6on&limitofworkingmemoryload)

•  TheMDDsof learners’ interlanguage at different learningphasesaresignificantlylowerthantheircorrespondingrandomlanguages(RL1 and RL2). This indicates that Chinese EFL learners developtheirEnglishproficiencyunderthepressureofDDM.

•  RL2hasalowerMDDthanRL1,andnatural languagehasalowerMDDthanRL2,whichsuggeststhatsyntaxalsoplaysakeyroleinminimizing the MDD of second language learners’ interlanguagesystem.

Page 31: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  The distribu6on of dependency distances of RL1 of secondlanguage learners’ wri6ngs cannot fit any exponen6aldistribu6on or power law distribu6on models. However, thedistribu6on of dependency distances of RL2 and naturallanguagecanwellfittheZipf-Alekseevdistribu6on.Projec6vityisthebackgroundmechanismthatcausesthisphenomenon.

•  The parameters in the Zipf-Alekseev distribu6on of RL2 haveno correla6on with second language learners’ languageproficiency. This can be explained by syntax. Compared withnatural language, though RL2 is projec6ve as naturallanguages,itdoesn’tobeysyntac6crules.

Page 32: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

•  The current study corroborates that DDM is a languageuniversalnotonlypresentintheuseoffirstlanguage,butalsoin the use of second language. This helps clarify therela6onship between human cogni6on and second language.There is also a threshold that theMDDs of second languagedon’texceedanditiswithinworkingmemorycapacity.Studieson the dependency distances in rela6on to the cogni6vedemandsonhumancogni6vesystemwillremaintheresearchfocus for linguists or scholars in the field of cogni6on andpsycholinguis6cs.

Page 33: Minimiza’on and Probability Distribu’on of Dependency ... · (2.532±3.056), WSJ2 (2.516±2.952), WSJ3 (2.625±3.296)and WSJ4 (2.545±3.097) The MDD of high-level Chinese EFL

ThankYou!