Feature Structure Unification Syntactic Parser 2.0

L-R Feature Structure Unification Syntactic Parser Richard Caneba

RPI Cognitive Science Department

Human-Level Intelligence Laboratory

Intuitions • An interpretive grammar views syntax as finding the most

appropriate sequence of head and dependency relationship between phrases and words.

• Language understanding occurs (roughly) left to right

• Syntactic trees have a flat structure, that gives no syntactic preferences to sequences of adjunctive modifiers of the same category (adjectives, adverbs, modifying prepositional phrases)

• We can infer a number of things immediately from the perception of a weird, although by no means all things

Intuitions cont’d • There are many patterns that exist in natural language, that

can be deterministic in some cases, and must be defeasible/probabilistic in others.

• Reliably deterministic: • [Det N] => NP[Det N]

• [Adj N] => NP[Adj N]

• Defeasible: • *V NP NP…+ (<1.0)> VP*V NP NP…+

• *V NP NP…+ (<1.0)> VP*V NP*NP…+…+

• Make an attempt to do search ONLY if there is a genuine ambiguity as to what the next step in a L-R parse should be • Second object/Relative clause modifier in ditransitive context

• Prepositional phrase attachment

Feature Structure Unification • A traditional challenge with the HPSG theory of grammar is

that, in order to preserve the recursiveness of their grammar rules, they were required to have a “right-branching” structure that posited additional feature structure nodes for each dependency-head relationship the theory posits

• This is to some extent slightly cognitively unrealistic:

• Posits an unecessary amount of structure for a syntactic parse

• Intuitively there is no syntactic distinction that should be made between sequences of adjuncts (it’s hard to tell the difference between “the angry green dog” and the “green angry dog.”

Lexical Representation of Syntax • Each word posits a sequence of head-dependency

relationships that form a “phrasal chain.”

• These chains are based on the notion that we can infer immediately some head-dependency relationships based on the syntactic category of the word.

• Roughly, each node in a chain is of three types (not explicitly defined in the lexicon, but nonetheless present):

• Word Level (WordUtteranceEvent)

• Dependency Level (PhraseUtteranceEvent)

• HeadLevel (PhraseUtteranceEvent)

Lexical Representation of Syntax • Let’s do a quick example to show the lexical syntactic

representation:

• “the angry dog”

• With part-of-speech tags, that is:

• [Det the][Adj angry][N dog].

• The representation in di-graph form:

Lexical Representation of Syntax

PhraseUtteranceEvents

WordUtteranceEvents

CommonNoun

Noun IsA

CandType Verb

CandType Preposition

Specifier

Syntactic Entry for a Common Noun

CandType Noun

Determiner IsA

WordUtteranceEvents

Adjective

Noun IsA

Syntactic Entry for an Adjective

CandType Noun

NOTE: will need to posit a dependency layer, to account for adverbs that modify the adjective i.e. “really big”.

WordUtteranceEvents

Determiner

Noun IsA

CandType Verb

Syntactic Entry for a Determiner

CandType Noun

Grammar Rules • In our example, we will need to have at least two rules:

• One that unifies the structures posited by the determiner to the structures posited by the common noun

• One that unifies the structures posited by adjective, either to the determiner or the noun

• Let’s consider this from L-R:

• First, unify the Det-NP-XP structure chain to the Adj-NP structure chain

• Next, unify that resulting structure chain to the N-NP-XP structure chain

Grammar Rules • Determiner-Adjective Rule

Determiner

Noun IsA

CandType Noun

Adjective

Noun IsA

CandType Noun

Grammar Rules

• Determiner-Adjective Rule

Determiner

Noun IsA

CandType Noun

Adjective

Noun IsA

CandType Noun

Grammar Rules • Determiner-Adjective Rule

Determiner IsA

Noun IsA

Verb CandType

Preposition

CandType Noun

Adjective IsA

CandType

Grammar Rules • We would like to allow for anywhere from 0-infinite number

of adjectives to stand between the determiner and the noun that selects the determiner as its specifier.

• We can achieve this by explicitly stating that whenever a Det chain and an Adj chain are unified, it’s exposed as a determiner on the right wall of the growing parse, as opposed to an adjective.

Grammar Rules • Determiner-Adjective Resulting Structure

Determiner IsA

Noun IsA

Verb CandType

Preposition

CandType Noun

Adjective IsA

CandType

Grammar Rules • Determiner-Adjective Resulting Structure + NP

Determiner IsA

Noun IsA

Verb CandType

Preposition

Adjective IsA

CommonNoun

CandType

Verb CandType

Preposition

CandType

Grammar Rules • Expose the resulting structure from the Det-Adj unification as

just the Det structure:

Det Adj N

Border

Frontier

Det Adj N

Border

Frontier

Grammar Rules

<constraint shouldFalsify="false"> Border(?ba, ?t0, ?w)^ Border(?bb, ?t0, ?w)^ Frontier(?fa, ?t1, ?w)^ Frontier(?fb, ?t1, ?w)^ Meets(?t0, ?t1, E, ?w)^ PartOf(?ba, ?bb, E, ?w)^ PartOf(?fa, ?fb, E, ?w)^ IsA(?ba, Determiner, E, ?w)^ IsA(?bb, Noun, E, ?w)^ IsA(?fa, Adjective, E, ?w)^ IsA(?fb, Noun, E, ?w) ==> Same(?bb, ?fb, E, ?w)^ Border(?ba, ?t1, ?w)^ </constraint>

<constraint shouldFalsify="false"> Border(?ba, ?t0, ?w)^ Border(?bb, ?t0, ?w)^ Frontier(?fa, ?t1, ?w)^ Frontier(?fb, ?t1, ?w)^ Meets(?t0, ?t1, E, ?w)^ PartOf(?ba, ?bb, E, ?w)^ PartOf(?fa, ?fb, E, ?w)^ IsA(?ba, Determiner, E, ?w)^ IsA(?bb, Noun, E, ?w)^ Specifier(?fa, ?spr, E, ?w)^ IsA(?spr, Determiner, E, ?w)^ IsA(?fb, Noun, E, ?w)^ Heard(?wue, E, ?w)^ IsA(?wue, WordUtteranceEvent, ?t1, ?w) ==> Same(?ba, ?spr, E, ?w)^ Same(?bb, ?fb, E, ?w)^ Border(?wue, ?t1, ?w)^ _NPSPR(?ba, ?bb, ?fa, ?fb, E, ?w) </constraint>

Grammar Rules

*send+ *john+ *a+ *message+ *that+ *says+ *“hi”+.

Grammar Rules

[V send] [N john] [Det a] [N message] [RelP that] [V says] [Q “hi”+.

Grammar Rules

NP VP VP VP

XP XP XP

Grammar Rules

NP NP NP

NP VP VP VP

XP XP XP XP

Grammar Rules

NP NP NP

NP VP VP VP

XP XP XP XP

Grammar Rules

NP NP NP

NP VP VP VP

XP XP XP XP

Grammar Rules

NP NP NP

NP VP VP VP

XP XP XP XP

Grammar Rules

NP NP NP

NP VP VP VP

XP XP XP XP

Grammar Rules

NP NP NP

NP VP VP VP

XP XP XP XP

Grammar Rules

Grammar Rules • Benefits of this feature-structure unification parse:

• Captures the intuition that when we hear a word, and posit its feature structure, we can infer the existence of not only the word’s direct feature structure (usually generated by lexical rules) but also the existence of additional structures and their head/dependency relationships, and some definition of the values in the structure.

• Ambiguities (i.e. the head of an NP) are resolved from L-R through lazy definitions and unificiation of under-defined structures to well-defined structures in terms of particular features.

• Posits no more additional structures in the parse tree than is necessary in order to reflect a parse, whereas theories like HPSG posited by a large number of structures in a branching tree in order to preserve the recursivity of its grammar rules.

• However, we have shown that with feature structure unification, at least in theory, we can preserve recursivity of many of the rules without requiring a left or right branching structure.

• All of the necessary structure to build a parse are known from the beginning.

Grammar Rules! • The future:

• Ungrammaticality: when objects aren’t where they are supposed to be, search for a likely head-dependency relationship • Missing arguments: “Car is big.” • Extra words (rare to have full content words be considered extra, but occurs in natural language: “I saw the, um,

car.”) • Dependents out of order: “Give the car me.” • Dangling dependent: “ • Will require a good branch and bound system, that only performs search when what is expected/predicted

reasonably is violated.

• Give a feature-structure unification account of garden path sentence • Should be fairly natural given the L-R predictive nature of the parser

• Attach a semantic representation that generates word-sense based on head-dependency relationships. • Syntax should be closely tied to semantics, in that both serve to help compute each other to varying degrees.

• Examine discourse from a syntactic perspective, and syntax from a discourse perspective, and use to disambiguate simultaneously:

Notes on Theory (boring) • By having a lexical representation that is closely tied to the syntax, a number of advantages

fall out: • Parsimony: by allowing a lot of information to be loosely defined/undefined at the lexical

level, we do not need to posit additional lexical entries to cover all possible configurations of a phrases arguments in the entry, nor do we need an excessive number of lexical rules to generate these representations.

• Generativity: a word’s sense is at least in part generated by its relationship to its dependents and head, and the semantic/syntactic type that these dependents/heads have in theory can compute a words sense on the fly (inspired by GL theory from Pustejovsky).

• Context embedding: by tying your theory of the lexicon closely to syntactic theory, you move towards embedding your lexical representation in a cognitive system that is closely tied to the way words are ACTUALLY used.

Lexical Mosaics • Thus, we can see that the sense of words comes from a number of

different locations: • Memory • Syntactic context • Pragmatic/Discourse factors

• It is the hope for future research to tie these together in an organized way to give a theory on lexical representation that is tied closely to these factors, in a computable and tractable manner.

• Early goals: • Compute word senses from syntactic context + memory (very

difficult) • Use syntactic context to disambiguate lexical ambiguity • Use generative word sense to disambiguate syntactic ambiguity • Simultaneously attempt to give a computational account of lexical

memory, syntactic parsing, and pragmatic/discourse.

Feature Structure Unification Syntactic Parser 2.0

Technology

Transcript of Feature Structure Unification Syntactic Parser 2.0

Resume parser

Parser Generation

Lecture 4: Syntactic Analysis - Computer Scienceolivier/comp524/Lecture04.pdfGoal of Lecture Scanner (lexical analysis) Parser (syntax analysis) Semantic analysis & intermediate code

A RecognitionBased Syntactic Foundationvganesh/TEACHING/S2014/ECE351/lectures... · Designing a Language Syntax 1.Formalize syntax via contextfree grammar 2.Write a YACC parser specification

1 Parsing The scanner recognizes words The parser recognizes syntactic units Parser operations: Check and verify syntax based on specified syntax rules.

CYK Parser

Sabrina Gerth and Peter beim Graben- Unifying syntactic theory and sentence processing difficulty through a connectionist minimalist parser

1 Structure of Compilers Lexical Analyzer (scanner) Modified Source Program Parser Tokens Semantic Analysis Syntactic Structure Optimizer Code Generator.

CSCI312 Principles of Programming Languagesxl10/cs312/slides/lecture4.pdfCopyright © 2006 The McGraw-Hill Companies, Inc. Syntactic Analysis! Phase also known as: parser . Created

TINA: A PROBABILISTIC SYNTACTIC PARSER FOR SPEECH … · 2011-05-14 · NP ==> ARTICLE ADJECTIVE NOUN NP ==> ARTICLE ADJECTIVE ADJECTIVE NOUN The Resulting Probabilistic Network:

The Scanner and The Parser - EPITAtiger/lecture-notes/slides/ccmp/02-parser... · 1 Flex & Bison: Recalls 2 Semantic Values Coupling Parser and Scanner Parser Scanner 3 Locations

Parser HTML

B211 Parser

Data Parser

1 JavaCUP JavaCup (Construct Useful Parser) is a parser generator; Produce a parser written in java, itself is also written in Java; There are many parser.

Faculté des Lettes, Département de Linguistique FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser Violeta Seretan, Eric Wehrli, Luka.

Opus Novum Review Grammar Case Case Case Syntactic structures (clauses, absolutes) Syntactic structures (clauses, absolutes) Syntactic structures Syntactic.

6-1 6 Syntactic analysis Aspects of syntactic analysis Tokens Lexer Parser Applications of syntactic analysis Compiler generation tool ANTLR.

An Emergentist Approach to Syntax William O’Grady 1 ... · The proposed unification thus favors the theory of processing, which for all intents and purposes simply subsumes syntactic

New research methods for investigating criminal networks ...Step 2: Cross-document event coreference opinion miner word sense disambiguation multiwords tagger syntactic parser tokenizer