Leveraging Discourse Information Effectively for Authorship Attribution (2017)
Elisa Ferracane, Su Wang, and Raymond J. Mooney
We explore techniques to maximize the effectiveness of discourse information in the task of authorship attribution. We present a novel method to embed discourse features in a Convolutional Neural Network text classifier, which achieves a state-of-the-art result by a significant margin. We empirically investigate several featurization methods to understand the conditions under which discourse features contribute non-trivial performance gains, and analyze discourse embeddings.
To Appear In In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP-17), Taipei, Taiwan, November 2017.

Raymond J. Mooney Faculty mooney [at] cs utexas edu