Midterm Review
-
Upload
flavia-newman -
Category
Documents
-
view
20 -
download
0
description
Transcript of Midterm Review
![Page 1: Midterm Review](https://reader036.fdocuments.net/reader036/viewer/2022071807/56812dcf550346895d931067/html5/thumbnails/1.jpg)
Midterm Review
CS4705
Natural Language Processing
![Page 2: Midterm Review](https://reader036.fdocuments.net/reader036/viewer/2022071807/56812dcf550346895d931067/html5/thumbnails/2.jpg)
• Statistical v. Symbolic Processing– 80/20 Rule
• Regular Expressions • Finite State Automata
– Determinism v. non-determinism– (Weighted) Finite State Transducers
• Morphology– Word Classes– Inflectional v. Derivational– Affixation, infixation, concatenation– Morphotactics
Midterm Review
![Page 3: Midterm Review](https://reader036.fdocuments.net/reader036/viewer/2022071807/56812dcf550346895d931067/html5/thumbnails/3.jpg)
• Morphological parsing– Koskenniemi’s two-level morphology– Porter stemmer
• Minimum Edit Distance (Levenshtein)• N-grams
– Markov assumption– Chain Rule– Language Modeling
• Simple, Adaptive, Class-based (syntax-based), bursty
– Smoothing• Add-one, Witten-Bell, Good-Turing
– Back-off– Perplexity, Entropy
• Maximum Likelihood Estimation
![Page 4: Midterm Review](https://reader036.fdocuments.net/reader036/viewer/2022071807/56812dcf550346895d931067/html5/thumbnails/4.jpg)
• Syntax– Chomsky’s view: Syntax is cognitive reality– Parse Trees
• Dependency Structure
– Part-of-Speech Tagging• Hand Written Rules v. Statistical v. Hybrid• Brill Tagging
– Types of Ambiguity
• Context Free Grammars– Top-down v. Bottom-up Derivations
• Left Corners
– Grammar Equivalence– Normal Forms (CNF)
![Page 5: Midterm Review](https://reader036.fdocuments.net/reader036/viewer/2022071807/56812dcf550346895d931067/html5/thumbnails/5.jpg)
• Probabilistic Parsing– (p)CYK, Earley Parsing– Derivational Probability– Lexicalization– Classification– Supertagging
• Machine Learning– Dependent v. Independent variables– Training v. Development Test v. Test sets– Feature Vectors– Metrics
• Accuracy• Precision, Recall, F-Measure
– Gold Standards
![Page 6: Midterm Review](https://reader036.fdocuments.net/reader036/viewer/2022071807/56812dcf550346895d931067/html5/thumbnails/6.jpg)
• Semantics– Meaning Representations– Semantic Roles, Subcategorization frames– FOPC
• Pros• Cons
– Temporal Representations• Richenbach
– Aspect– Beliefs, Desires, Intention Representation– Syntax-driven semantics