Improving Black-box Speech Recognition using Semantic Parsing (2017)
Speech is a natural channel for human-computer interaction in robotics and consumer applications. Natural language understanding pipelines that start with speech can have trouble recovering from speech recognition er- rors. Black-box automatic speech recognition (ASR) systems, built for general purpose use, are unable to take advantage of in-domain language models that could otherwise ameliorate these errors. In this work, we present a method for re-ranking black-box ASR hypotheses using an in-domain language model and semantic parser trained for a particular task. Our re-ranking method significantly improves both transcription accuracy and seman- tic understanding over a state-of-the-art ASR’s vanilla output.
To Appear In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP-17), Taipei, Taiwan, November 2017.

Rodolfo Corona Undergraduate Student rcorona [at] utexas edu
Raymond J. Mooney Faculty mooney [at] cs utexas edu
Jesse Thomason Ph.D. Student jesse [at] cs utexas edu