How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization
description
Transcript of How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization
![Page 1: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/1.jpg)
HOW DOES SYNTACTIC STRUCTURE MANIFEST ITSELF THROUGH TEXT CORPORA: OSSETIC NOMINALIZATION
Pavel Graschenkov,Institute of Oriental Culture (Moscow), [email protected] Malyutina, MSU, [email protected] Ionov, MSU, [email protected]
![Page 2: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/2.jpg)
OVERVIEW
Methodology Ossetic basics Creating corpora Extracting data Evaluating data Interpreting data Conclusion
![Page 3: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/3.jpg)
WHAT IS THIS ALL ABOUT?
• Syntactic researches are often made by questioning native speakers
• Sometimes speakers don’t express clear preference for a specific surface structure
• Corpora-oriented studies could help• Some languages have problems with corpora
studies
![Page 4: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/4.jpg)
OSSETIC AND CORPORA STUDIES
Problems: No tagged corpora No e-dictionaries or tag sets
But: Rich morphology
⇒ we can rely on affixation Well-developed literature tradition
⇒ large text array
![Page 5: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/5.jpg)
OSSETIC AND CORPORA STUDIES
Research strategy: Searching untagged corpora Subsequent supervised filtration Manual tagging the results
![Page 6: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/6.jpg)
BASICS INFORMATION ABOUT OSSETIC
Iranian language Mostly synthetic 9 grammatical cases
Marked by suffixes No accusative case
Morphosyntactic alignment: nominative-accusative
![Page 7: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/7.jpg)
BASICS INFORMATION ABOUT OSSETIC
Unmarked case: nominative Direct Object case: nominative or genitive
Nominalizations are formed by –yn– suffix
![Page 8: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/8.jpg)
OSSETIC NOMINALIZATION: PROBLEMS
Theoretical problems:1. How much VP structure is involved in it2. How DP structure influences nominalization
In Ossetic: Both problems are topical, because –yn– forms
are homonymous between infinitives and nominalizations
![Page 9: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/9.jpg)
OSSETIC NOMINALIZATION: PROBLEMS
1. Nominal:…iron ævzag ahwyr kænyn-yOssetic language study-ING-GENraydayæn etap…beginning stagethe first stage of studying Ossetic
2. Infinitival:…raidydta ahwyr kænynhe-started study-INGmatematikon naukæ-tæ…mathematical science-PLhe began studying mathematical sciences
![Page 10: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/10.jpg)
OSSETIC NOMINALIZATION: ARGUMENTS
According to native speakers’ judgements: Both external and internal arguments participate
in nominalizations Flexible word order in simple predication Strict left branching in noun phrases
So direct questioning doesn’t clarify: Arguments that are in argument list Directionality of branching
![Page 11: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/11.jpg)
OSSETIC NOMINALIZATION: ORDERING
fyd-y sævæg daw-yn-father-GEN scythe sharp-ING
fyd-y daw-yn- sævægfather-GEN sharp-ING scythe
daw-yn- fyd-y sævægsharp-ING father-GEN scythe
daw-yn- sævæg fyd-ysharp-ING scythe father-GEN
All these orderings were attested by native speakers
![Page 12: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/12.jpg)
OSSETIC NOMINALIZATION: BASIS
Artemis Alexiadou, 2004: Nominalizations are always merged under the
same structure Syntactic material is the same, differences
are in phi-features Differentiation of the phi-features is induced
by external context Every feature set forces specific internal
configuration
![Page 13: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/13.jpg)
OSSETIC NOMINALIZATION: V VS. N
Two most prominent patterns are nominal and verbal one
Nominal: Merged under postpositions and in noun phrases Acquire all properties of noun phrases
Able to assign Gen to their subject Shouldn’t exhibit word order permutation
Verbal: Merged under modals and phrase verbs
Do not have own subjects Exhibit word order dependency on the information
structure
![Page 14: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/14.jpg)
OSSETIC NOMINALIZATION: HYPOTHESIS
We expect to observe the following distributional properties: No difference in number or marking of
arguments Differentiation in surface string ordering:
Nominal contexts: strict left branching Verbal contexts: flexible ordering
These two statements were chosen for testing by corpora method
![Page 15: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/15.jpg)
EXTRACTING DATA: CORPUS & TOOLS
Corpus: Consisted of 1.3 million words Modern fiction and press
Extraction: Indexing text array Querying:
Word1 + distance span + word2 Word1 and word2 are regular expressions
![Page 16: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/16.jpg)
EXTRACTING DATA: THE PROCESS
Initially extracted ~20 000 sentences Examples with 8 most frequent verbs were
chosen Verb Translation
‘arazyn’ make
‘zuryn ’ say
‘sæwyn ’ go
‘hwydy kænyn ’ think
‘maryn ’ kill
‘særyn ’ live
‘ahwyr kænyn ’ study
‘pajda kænyn’ use
![Page 17: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/17.jpg)
EXTRACTING DATA: THE PROCESS
Distinguishing V from N: Genitive forms as the examples of nominal
contexts Constructions with ‘start / begin’, ‘want’ and
‘need’ as the examples of verbal contexts ~700 contexts were left after filtering They were manually translated and tagged:
Context
Presence of subject
Presence of direct object
Directionality of branching
… … … …
![Page 18: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/18.jpg)
EVALUATING RESULTS
Total: 668 instances 355 nominal contexts 313 verbal contexts
![Page 19: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/19.jpg)
EVALUATING RESULTS: SUBJECTS
Only 7 examples All in nominal contexts ⇒ They are
pragmatically introduced participants, not true arguments
![Page 20: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/20.jpg)
EVALUATING RESULTS: DIRECT OBJECTS
P resenc e of objec t
79,1
3%
73,1
4%
20,8
7%
26,8
6%
Nominal V erbal
With objec t Without objec t
Total: 291 context Nominal: 163 = 79% Verbal: 128 = 73% Paired t-test (amount of
subjects of each verb in nominal vs. verbal contexts):
t(5) = 0.34, p > 0.1⇒ no significant difference
![Page 21: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/21.jpg)
No nominalizations with both subject and direct object have been attested
EVALUATING RESULTS: SUBJECT WITH DO
![Page 22: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/22.jpg)
EVALUATING RESULTS: BRANCHING
Left branching was met: In 98% of nominal contexts In 65% of verbal contexts
Yates-corrected chi-square test (nominal and verbal context in the amount of examples with left vs. right branching):
p<.001⇒ significant difference
![Page 23: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/23.jpg)
INTERPRETING DATA: ARGUMENT STRUCTURE
Two observations can be done:1. Both types of nominalizations lack subject on
argument list2. Direct objects are equally frequent in both
types of nominalization⇒ Argument structures are the same
![Page 24: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/24.jpg)
INTERPRETING DATA: WORD ORDER
Nominal contexts are strictly left branching > 1 / 3 of infinitival contexts are right
branching
Explanation: Nominalizations in nominal contexts do not allow
pragmatically driven scrambling (like in regular DPs)
Infinitival nominalizations are not restricted in this option
Branching directionality depends on phi-features supplied by external context, internal structure is the same
![Page 25: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/25.jpg)
CONCLUSION
Ossetic nominalizations do not project external arguments
Their argument structure can include only direct object
The internal structure of nominalization is a function of the context where it was merged
![Page 26: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/26.jpg)
ACKNOWLEDGEMENTS
We are very grateful to all our colleagues and especially to Anastasia Garejshina and Lidia Kirpo for their help on collecting text corpora and to the chiefs of the expedition, Sergei Tatevosov and Ekaterina Lyutikova, for their assistance both in and outside linguistics.
![Page 27: How Does Syntactic Structure Manifest Itself Through Text Corpora: Ossetic Nominalization](https://reader034.fdocuments.net/reader034/viewer/2022051517/56815a8b550346895dc80110/html5/thumbnails/27.jpg)
THANK YOU!