Learning to "Read Between the Lines" using Bayesian Logic Programs (2012)
Most information extraction (IE) systems identify facts that are explicitly stated in text. However, in natural language, some facts are implicit, and identifying them requires "reading between the lines". Human readers naturally use common sense knowledge to infer such implicit information from the explicitly stated facts. We propose an approach that uses Bayesian Logic Programs (BLPs), a statistical relational model combining first-order logic and Bayesian networks, to infer additional implicit information from extracted facts. It involves learning uncertain commonsense knowledge (in the form of probabilistic first-order rules) from natural language text by mining a large corpus of automatically extracted facts. These rules are then used to derive additional facts from extracted information using BLP inference. Experimental evaluation on a benchmark data set for machine reading demonstrates the efficacy of our approach.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL-2012) (2012), pp. 349--358.

Slides (PPT)
Hyeonseo Ku Masters Alumni yorq [at] cs utexas edu
Raymond J. Mooney Faculty mooney [at] cs utexas edu
Sindhu Raghavan Ph.D. Alumni sindhu [at] cs utexas edu