- Active Learning for Probability Estimation using Jensen-Shannon Divergence
Melville, P., Yang, S. M., Saar-Tsechansky, M., and Mooney, R.J.
Proceedings of the 16th European Conference on Machine Learning, Porto, Portugal, pp. 268-279, October 2005.
Paper ID: 161
Category: Active Learning
Active selection of good training examples is an important approach to reducing data-collection costs in machine learning; however, most existing methods focus on maximizing classification accuracy. In many applications, such as those with unequal misclassification costs, producing good class probability estimates (CPEs) is more important than optimizing classification accuracy. We introduce novel variations of two extant active-learning algorithms, Boostrap-LV and ACTIVEDECORATE, by using Jensen-Shannon divergence (a similarity measure for probability distributions) to improve sample selection for optimizing CPEs. Comprehensive experimental results demonstrate the benefits of our enhancements.

mooney@cs.utexas.edu