Latent Variable Models of Concept-Attribute Attachment

Supplementary Material

Joseph Reisinger (joeraii@cs.utexas.edu)
Marius Pasca (mars@google.com)



Paper | Talk Slides | Supplemental derivations


Abstract

This paper presents a set of Bayesian methods for automatically extending the WordNet ontology with new concepts and annotating existing concepts with generic property fields, or attributes. We base our approach on Latent Dirichlet Allocation and evaluate along three dimensions: (1) the precision of the ranked lists of attributes, (2) the quality of the attribute assignments to WordNet concepts, and (3) the specificity of the attributes at each concept. In all cases we find that the principled LDA-based approaches outperform previously proposed heuristic methods.

Data sets

Coming soon