Publications: Unsupervised Learning, Clustering, and Self-Organization
Unsupervised learning does not require annotation or labeling from a human teacher; the idea is to learn the structure of the data from unlabeled examples. The most common unsupervised learning task is clustering, i.e. grouping instances into a discovered set of categories containing similar instances. Self-organizing maps in addition visualize the topology of the clusters on a map. Our work in this area includes applications on lexical semantics, topic modeling, and discovering latent class models, as well as methods for laterally connected, hierarchical, sequential-input, and growing self-organizing maps.
Unsupervised Code-Switching for Multilingual Historical Document Transcription [Details] [PDF] [Slides (PDF)] Dan Garrette and Hannah Alpert-Abrams and Taylor Berg-Kirkpatrick and Dan Klein In Proceedings the 2015 Conference of the North American Chapter of the Association for Computational Linguistics -- Human Language Technologies (NAACL HLT 2015), 1036--1041, Denver, Colorado, June 2015.
A Mixture Model with Sharing for Lexical Semantics [Details] [PDF] [Slides (PDF)] Joseph Reisinger and Raymond J. Mooney In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2010), 1173--1182, MIT, Massachusetts, USA, October 9--11 2010.
Cross-cutting Models of Distributional Lexical Semantics [Details] [PDF] [Slides (PDF)] Joseph S. Reisinger June 2010. Ph.D. proposal, Department of Computer Sciences, University of Texas at Austin.
Spherical Topic Models [Details] [PDF] [Slides (PDF)] Joseph Reisinger, Austin Waters, Bryan Silverthorn, and Raymond J. Mooney In Proceedings of the 27th International Conference on Machine Learning (ICML 2010), 2010.
Spherical Topic Models [Details] [PDF] Joseph Reisinger, Austin Waters, Bryan Silverthorn, and Raymond Mooney In NIPS'09 workshop: Applications for Topic Models: Text and Beyond, 2009.
Model-based Overlapping Clustering [Details] [PDF] A. Banerjee, C. Krumpelman, S. Basu, Raymond J. Mooney and Joydeep Ghosh In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-05), 2005.