- Active Learning for Natural Language Parsing and Information Extraction
Cynthia A. Thompson, Mary Elaine Califf and Raymond J. Mooney
Nominated for Best Paper Award
Proceedings of the Sixteenth International Machine Learning Conference (ICML-99) , Bled, Slovenia, pp. 406-414, June 1999.
Paper ID: 92
Category: Active Learning, Information Extraction, Natural Language Learning, Learning for Semantic Parsing
In natural language acquisition, it is difficult to gather the annotated data needed for supervised learning; however, unannotated data is fairly plentiful. Active learning methods attempt to select for annotation and training only the most informative examples, and therefore are potentially very useful in natural language applications. However, existing results for active learning have only considered standard classification tasks. To reduce annotation effort while maintaining accuracy, we apply active learning to two non-classification tasks in natural language processing: semantic parsing and information extraction. We show that active learning can significantly reduce the number of annotated examples required to achieve a given level of performance for these complex tasks.

mooney@cs.utexas.edu