Leveraging Discourse Information Effectively for Authorship Attribution

Leveraging Discourse Information Effectively for Authorship Attribution (2017)

Elisa Ferracane, Su Wang, and Raymond J. Mooney

We explore techniques to maximize the effectiveness of discourse information in the task of authorship attribution. We present a novel method to embed discourse features in a Convolutional Neural Network text classifier, which achieves a state-of-the-art result by a significant margin. We empirically investigate several featurization methods to understand the conditions under which discourse features contribute non-trivial performance gains, and analyze discourse embeddings.

View:

PDF

Citation:

In In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP-17), pp. 584–593, Taipei, Taiwan, November 2017.

Bibtex:

Presentation:

Slides (PDF) Video

People

Raymond J. Mooney

Faculty

mooney [at] cs utexas edu

Areas of Interest

Natural Language Processing

Labs

Machine Learning