UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
admin
Semisupervised Clustering for Intelligent User Management (2004)
Sugato Basu
,
Mikhail Bilenko
, and
Raymond J. Mooney
Grouping users automatically based on their system usage can be beneficial in an autonomic computing environment. Clustering algorithms can generate meaningful user groups that provide important insights to system administrators about user profiles and group policies. In particular, if a small amount of supervision is provided by the administrator to the clustering process, semi-supervised clustering algorithms can use this supervision to generate clusters which are more useful for user management. In this work, we demonstrate the utility of semi-supervised clustering in intelligent user management. We collect publicly available system usage data of users in a university computing environment, and cluster the users using semi-supervised hierarchical agglomerative clustering based on the profile of the processes they run. Initial supervision is provided in the form of a few users running a specific process. Semi-supervised clustering gives us more meaningful clusters than unsupervised clustering in this domain, demonstrating that our technique can find interesting and useful groups in data with minimal user intervention.
View:
PDF
,
PS
Citation:
In
Proceedings of the IBM Austin Center for Advanced Studies 5th Annual Austin CAS Conference
, Austin, TX, February 2004.
Bibtex:
@inproceedings{basu:ibm-cas04, title={Semisupervised Clustering for Intelligent User Management}, author={Sugato Basu and Mikhail Bilenko and Raymond J. Mooney}, booktitle={Proceedings of the IBM Austin Center for Advanced Studies 5th Annual Austin CAS Conference}, month={February}, address={Austin, TX}, url="http://www.cs.utexas.edu/users/ai-lab/?basu:ibm-cas04", year={2004} }
People
Sugato Basu
Alumni
sugato@cs.utexas.edu
Mikhail Bilenko
Alumni
mbilenko@microsoft.com
Raymond J. Mooney
Professor
mooney@cs.utexas.edu
Areas of Interest
Semi-Supervised Learning
Machine Learning
Labs
Machine Learning