Kallirroi Georgila, Ning Wang, Jonathan Gratch: “Cross-Domain Speech Disfluency Detection”

September 25, 2010 | Tokyo, Japan

Speaker: Kallirroi Georgila, Ning Wang, Jonathan Gratch
Host: 11th Annual SIGdial Meeting on Discourse and Dialogue

We build a model for speech disfluency detection based on conditional random fields (CRFs) using the Switchboard corpus. This model is then applied to a new domain without any adaptation. We show that a technique for detecting speech disfluencies based on Integer Linear Programming (ILP) significantly outperforms CRFs. In particular, in terms of F-score and NIST Error Rate the absolute improvement of ILP over CRFs exceeds 20% and 25% respectively. We conclude that ILP is an approach with great potential for speech disfluency detection when there is a lack or shortage of in-domain data for training.