Improving Black-box Speech Recognition using Semantic Parsing

Improving Black-box Speech Recognition using Semantic Parsing (2017)

Rodolfo Corona, Jesse Thomason, and Raymond J. Mooney

Speech is a natural channel for human-computer interaction in robotics and consumer applications. Natural language understanding pipelines that start with speech can have trouble recovering from speech recognition errors. Black-box automatic speech recognition (ASR) systems, built for general purpose use, are unable to take advantage of in-domain language models that could otherwise ameliorate these errors. In this work, we present a method for re-ranking black-box ASR hypotheses using an in-domain language model and semantic parser trained for a particular task. Our re-ranking method significantly improves both transcription accuracy and semantic understanding over a state-of-the-art ASR’s vanilla output.

View:

PDF

Citation:

In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP-17), pp. 122-127, Taipei, Taiwan, November 2017.

Bibtex:

Presentation:

Poster

People

Rodolfo Corona	Undergraduate Alumni	rud721 [at] gmail com
Raymond J. Mooney	Faculty	mooney [at] cs utexas edu
Jesse Thomason	Ph.D. Alumni	thomason DOT jesse AT gmail

Areas of Interest

Learning for Semantic Parsing Semi-Supervised Learning

Software/Data

Improving Black-box Speech Recognition using Semantic Parsing	The data used in the paper Improving Black-box Speech Recognition using Semantic Parsing, IJCNLP 2017 can be downloaded ...	2019

Labs

Machine Learning