Use of CFGs in NLP
Syntactic parsing
Constituencies vs Dependencies parsing
Constituency parsing: Infers the structure of the phrases in a sentence
Dependency parsing: infers the structure of the words’ dependencies
Transformation into normal form
Constituency parsing
Downstream tasks(tareas posteriores) based on parsing
Named entity recognition in complex domains
Relationship extraction: both for semantic and temporal relations
Coreference resolution: to identify candidate matching references
Opinion mining regarding aspects of products or similar
Machine translation, to analyze the source sentence
Quiestion answering, particularly in high-precision scenarios
Attachment ambiguity
Key parsing problem
Correct attachment of the various constituents in a sentence, such as prepositional phrases, adverbial phrases, infinitives, ….
How to find the correct attachment?
Potential attachments grow exponentially with number n of constituents
Limitations of standard PCFGs
-PCFGs assume that the plausibility of structures is independent of the words used, i.e., each rule has a fixed probability
-However, specific words may make certain rules particularly (un)likely
Dependency Grammar
Graph representation of dependency grammar
-Each node is a token
- An edge connects a head with a dependent node
- The nodes and edges form a fully connected, acyclic tree with a single head(if available, the main verb of the first main clause is the head)
Identification of Dependencies:
Selected features of dependencies
Parsing Methods
Dynamic programming
Graph algorithmus
Transition-based parsing