Learning to Extract Proteins and their Interactions from Medline Abstracts (2003)
Razvan Bunescu, Ruifang Ge, Rohit J. Kate, Raymond J. Mooney, Yuk Wah Wong, Edward M. Marcotte, and Arun Kumar Ramani
We present results from a variety of learned information extraction systems for identifying human protein names in Medline abstracts and subsequently extracting interactions between the proteins. We demonstrate that machine learning approaches using support vector machines and hidden Markov models are able to identify human proteins with higher accuracy than several previous approaches. We also demonstrate that various rule induction methods are able to identify protein interactions with higher precision than manually-developed rules.
In Proceedings of the ICML-03 Workshop on Machine Learning in Bioinformatics, pp. 46-53, Washington, DC, August 2003.

Razvan Bunescu Ph.D. Alumni bunescu [at] ohio edu
Ruifang Ge Ph.D. Alumni grf [at] cs utexas edu
Rohit Kate Postdoctoral Alumni katerj [at] uwm edu
Raymond J. Mooney Faculty mooney [at] cs utexas edu
Yuk Wah Wong Ph.D. Alumni ywwong [at] cs utexas edu