UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Mining Knowledge from Text Using Information Extraction (2005)
Raymond J. Mooney
and R. Bunescu
An important approach to text mining involves the use of natural-language information extraction. Information extraction (IE) distills structured data or knowledge from unstructured text by identifying references to named entities as well as stated relationships between such entities. IE systems can be used to directly extricate abstract knowledge from a text corpus, or to extract concrete data from a set of documents which can then be further analyzed with traditional data-mining techniques to discover more general patterns. We discuss methods and implemented systems for both of these approaches and summarize results on mining real text corpora of biomedical abstracts, job announcements, and product descriptions. We also discuss challenges that arise when employing current information extraction technology to discover knowledge in text.
View:
PDF
,
PS
Citation:
SIGKDD Explorations (special issue on Text Mining and Natural Language Processing)
, Vol. 7, 1 (2005), pp. 3-10.
Bibtex:
@Article{mooney:sigkdd05, title={Mining Knowledge from Text Using Information Extraction}, author={Raymond J. Mooney and R. Bunescu}, volume={7}, journal={SIGKDD Explorations (special issue on Text Mining and Natural Language Processing)}, number={1}, pages={3-10}, url="http://www.cs.utexas.edu/users/ai-lab?mooney:sigkdd05", year={2005} }
People
Razvan Bunescu
Ph.D. Alumni
bunescu [at] ohio edu
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Areas of Interest
Information Extraction
Machine Learning
Text Data Mining
Labs
Machine Learning