UT ML Group: Bioinformatics

Bioinformatics concerns the development of computer databases and algorithms for learning, managing and processing biological information. Currently we are focusing on extracting structured information such as protein names and relationships from biological documents using natural language learning for information extraction.

By mining over 750,000 Medline abstracts for human protein interactions and integrating the results with existing databases, we have developed a fairly comprehensive database of 31,609 known human protein interactions. The resulting database is accessible though a web interface at Human Gene ID-SERVE

Publications

  1. Discriminative Structure and Parameter Learning for Markov Logic Networks [Abstract] [PDF]
    Tuyen N. Huynh and Raymond J. Mooney
    In Proceedings of the 25th International Conference on Machine Learning (ICML) , Helsinki, Finland, July 2008.

  2. Integrating Co-occurrence Statistics with Information Extraction for Robust Retrieval of Protein Interactions from Medline [Abstract] [PDF]
    Razvan Bunescu, Raymond Mooney, Arun Ramani and Edward Marcotte
    In Proceedings of the HLT-NAACL Workshop on Linking Natural Language Processing and Biology: Towards deeper biological literature analysis (BioNLP-2006), pp. 49-56, New York City, NY, June 2006.

  3. Consolidating the Set of Known Human Protein-Protein Interactions in Preparation for Large-Scale Mapping of the Human Interactome [Abstract] [PDF]
    Ramani, A.K., Bunescu, R.C., Mooney, R.J. and Marcotte, E.M.
    Genome Biology, 6, 5, r40(2005).

  4. Using Biomedical Literature Mining to Consolidate the Set of Known Human Protein-Protein Interactions [Abstract] [PDF]
    Ramani, A., Marcotte E., Bunescu, R., and Mooney, R.J.
    Proceedings of the ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, pp. 46--53, Detroit, MI, June 2005.

  5. Collective Information Extraction with Relational Markov Networks [Abstract] [PDF]
    Razvan Bunescu and Raymond J. Mooney
    Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-2004), pp. 439-446, Barcelona, Spain, July 2004.

  6. Relational Markov Networks for Collective Information Extraction [Abstract] [PDF]
    Razvan Bunescu and Raymond J. Mooney
    Proceedings of the ICML-2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL-2004), Banff, Canada, July 2004.

  7. Comparative Experiments on Learning Information Extractors for Proteins and their Interactions [Abstract] [PDF]
    Razvan Bunescu, Ruifang Ge, Rohit J. Kate, Edward M. Marcotte, Raymond J. Mooney, Arun Kumar Ramani, and Yuk Wah Wong
    Artificial Intelligence in Medicine (Special Issue on Summarization and Information Extraction from Medical Documents), 33, 2 (2005), pp. 139-155.

  8. Learning to Extract Proteins and their Interactions from Medline Abstracts [Abstract] [PDF]
    Razvan Bunescu, Ruifang Ge, Rohit J. Kate, Raymond J. Mooney, Yuk Wah Wong, Edward M. Marcotte, and Arun Kumar Ramani
    Proceedings of the ICML-2003 Workshop on Machine Learning in Bioinformatics, pp.46-53, Washington DC, August 2003.

  9. Extracting Gene and Protein Names from Biomedical Abstracts [Abstract] [PDF]
    Razvan Bunescu, Ruifang Ge, Raymond J. Mooney, Edward Marcotte, and Arun Kumar Ramani
    Unpublished Technical Note, March 2002.


mooney@cs.utexas.edu