Concept

Improving PCFGs by splitting NP non-terminals

We can split NP non-terminals into two versions: one for subjects, and one for objects. Having two nodes (NPsubjectNP_{subject} and NPobjectNP_{object}) would allow us to correctly model their different distributional properties since we could have different probabilities for the rules NPsubjectPRPNP_{subject} \to PRP and NPobjectPRPNP_{object} \to PRP. The intuition of splits can be implemented by doing parent annotation, in which we annotate each node with its parent in the parse tree. A PCFG can also be improved by splitting pre-terminal POS nodes. Adding specific tags for parts of speech can be useful in improving PCFG modeling.

0

1

Updated 2022-05-22

Contributors are:

Who are from:

Tags

Deep Learning (in Machine learning)

Data Science

Learn After