Authorship Attribution Using Probabilistic Context-Free Grammars (2010)
Sindhu Raghavan, Adriana Kovashka and Raymond Mooney
In this paper, we present a novel approach for authorship attribution, the task of identifying the author of a document, using probabilistic context-free grammars. Our approach involves building a probabilistic context-free grammar for each author and using this grammar as a language model for classification. We evaluate the performance of our method on a wide range of datasets to demonstrate its efficacy.
In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL-2010), pp. 38--42 2010.

Slides (PPT)
Raymond J. Mooney Faculty mooney [at] cs utexas edu
Sindhu Raghavan Ph.D. Alumni sindhu [at] cs utexas edu