ISI Natural Language Understanding
The ISI Natural Language Understanding project builds natural language understanding (NLU) capabilities into Virtual Human (VH) agents. With these NLU capabilities, VH agents parse and generate English. The agents "parse" English by converting a string of English words (e.g., a sentence) into symbols that represent its meaning. Similarly, they "generate" English by converting these symbols back into a grammatically correct sequence of English words. For each Virtual Human agent, the parser converts the output of the speech recognizer to the input of the system's internal dialogue manager, and the generator converts the output of the dialogue manager to the input of the speech synthesizer. Here's a simplified diagram of the process:
The parser and generator incorporate both statistical (learning) and rule-based (manual) processing. In our experiments, we explore ways to combine these methods to overcome the weaknesses of one with the strengths of the other. The project team builds various finite-state and statistically-trained parsers for understanding, and template-based, phrase expansion, statistical generators for sentence creation.
This project differs from others in the following ways:
- It includes both understanding and generation (most projects focus on one direction only)
- It includes prosodic information for parsing long sentences (no other project we know of does this)
- It explicitly combines statistical and rule-based components (most projects take one or the other approach exclusively)
- It requires less training data than most projects of this kind, because of the rule-based methods employed
Team
- David DeVault
- Eduard Hovy
Contact
- Eduard Hovy, , Information Sciences Institute (ISI)
