PhD Thesis
- Semi-supervised Clustering: Probabilistic Models, Algorithms and
Experiments [PDF]
Sugato Basu
Ph.D. thesis, Department of Computer Sciences, University of Texas at
Austin, 2005.
Book
- Constrained Clustering: Advances in Algorithms, Theory, and
Applications [Book website]
[Buy on Amazon]
Editors: Sugato Basu, Ian Davidson, Kiri L. Wagstaff
Publisher: CRC Press
Year: 2008
Book Chapter
- Constrained Partitional Clustering of Text Data: An Overview
Sugato Basu and Ian Davidson
In Text Mining: Classification, Clustering, and Applications
Editors: Ashok Srivastava and Mehran Sahami
Publisher: CRC Press
Year: 2009
- Probabilistic Semi-supervised Clustering with Constraints [PDF]
[Code]
Sugato Basu, Mikhail Bilenko, Arindam Banerjee, Raymond Mooney
In Semi-Supervised Learning
Editors: Olivier Chapelle, Bernhard Scholkopf, Alexander Zien
Publisher: MIT Press
Year: 2006
Computational Advertising
- PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce
[PDF]
Biswanath Panda, Joshua S. Herbach, Sugato Basu, Roberto J. Bayardo
Proceedings of the 35th International Conference on Very Large Data Bases
(VLDB-2009) (to appear).
- Predicting Bounce Rates in Sponsored Search Advertisements
[PDF]
D. Sculley, Robert Malkin, Sugato Basu, Roberto J. Bayardo
Proceedings of the 15th International ACM-SIGKDD Conference on Knowledge Discovery
and Data Mining, 2009, pp. 1325-1334.
Record Linkage
- Adaptive Product Normalization: Using Online Learning for Record Linkage
in Comparison Shopping
[Abstract]
[Gzipped
PS] [PDF]
Mikhail Bilenko, Sugato Basu, Mehran Sahami
Proceedings of the 5th IEEE International Conference on
Data Mining (ICDM-2005)
Social Networks
- A Social Query Model for Decentralized Search [PDF]
Arindam Banerjee, Sugato Basu
Second ACM Workshop on Social Network Mining and Analysis at the KDD
Conference (SNAKDD-08), Las Vegas, Nevada, August 2008.
- iLink: Search and Routing in Social Networks [PDF]
Jeffrey Davitz, Jiye Yu, Sugato Basu, David Gutelius, Alexandra Harris
Proceedings of the Knowledge Discovery and Data Mining (KDD)
Conference
(KDD-2007), San Jose, California, August 2007.
Clustering
- Multi-way Clustering on Relation Graphs [PDF]
Arindam Banerjee, Sugato Basu, Srujana Merugu
Proceedings of the SIAM International Conference on Data Mining
(SDM-2007), Minneapolis, Minnesota, April 2007.
(Best of SDM'07 Award)
- Topic Models over Text Streams: A Study of Batch and Online
Unsupervised Learning [PDF]
Arindam Banerjee, Sugato Basu
Proceedings of the SIAM International Conference on Data Mining
(SDM-2007), Minneapolis, Minnesota, April 2007.
Longer version (with more details in model description, experiments and
related work sections): [PDF]
- Measuring Constraint-Set Utility for Partitional Clustering
Algorithms [PDF]
Ian Davidson, Kiri L. Wagstaff, and Sugato Basu.
Proceedings of the Tenth European Conference on Principles and
Practice of Knowledge Discovery in Databases (PKDD-2006), Berlin,
Germany, September 2006.
- Model-based Overlapping Clustering [Abstract]
[Gzipped
PS] [PDF]
Arindam Banerjee, Chase Krumpelman, Sugato Basu, Raymond Mooney, Joydeep Ghosh
Proceedings of the 11th ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining
(KDD-2005), Chicago, IL, August 2005.
- Semi-supervised Graph Clustering: A Kernel Approach [Abstract]
[Gzipped
PS] [PDF]
Brian Kulis, Sugato Basu, Inderjit Dhillon, Raymond J. Mooney
Proceedings of the 22nd
International Conference on Machine Learning
(ICML-2005), Bonn, Germany, August 2005.
(Distinguished Student Paper Award)
- A Probabilistic Framework for Semi-Supervised Clustering [Abstract]
[Gzipped
PS] [PDF]
[Code]
Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney
Proceedings of the 10th ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining
(KDD-2004), Seattle, WA, August 2004.
(Best Research Paper Award)
- Integrating Constraints and Metric Learning in Semi-Supervised
Clustering [Abstract]
[Gzipped
PS] [PDF]
Mikhail Bilenko, Sugato Basu, and Raymond J. Mooney
Proceedings of the 21st International Conference on
Machine Learning (ICML-2004), Banff, Canada, July 2004.
Earlier workshop version: Comparing and Unifying Search-Based and Similarity-Based Approaches to
Semi-Supervised Clustering [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney
Proceedings of the ICML-2003 Workshop on the Continuum from Labeled to
Unlabeled Data in Machine Learning and Data Mining, pp.42-49,
Washington DC, August 2003.
- A Comparison of Inference Techniques for Semi-supervised Clustering with
Hidden Markov Random Fields
[Abstract]
[Gzipped
PS] [PDF]
Mikhail Bilenko, and Sugato Basu
Proceedings of the ICML-2004 Workshop on Statistical Relational Learning
and its Connections to Other Fields (SRL-2004),
Banff, Canada, July 2004.
- Semi-supervised Clustering with Limited Background Knowledge [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu
Proceedings of the 9th AAAI/SIGART Doctoral Consortium
, San Jose, California, July 2004.
- Active Semi-Supervision for Pairwise Constrained Clustering [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu, Arindam Banerjee, and Raymond J. Mooney
Proceedings of the SIAM International Conference on Data
Mining (SDM-2004), Lake Buena Vista, FL, April 2004.
- Semi-supervised Clustering for Intelligent User Management [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney
Proceedings of the IBM Austin Center for Advanced Studies 5th Annual
Austin CAS Conference, Austin, TX, February 2004.
- Semi-supervised Clustering by Seeding [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu, Arindam Banerjee, and Raymond J. Mooney
Proceedings of the 19th International Conference on Machine
Learning (ICML-2002), pp. 19-26, Sydney, Australia, July 2002.
Rule Mining
- Evaluating the Novelty of Text-Mined Rules using Lexical Knowledge [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti, and Joydeep Ghosh
Proceedings of the 7th ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining (KDD-2001), pp.
233-238, San Francisco, CA, August 2001.
Earlier workshop version: Using Lexical Knowlege to Evaluate the Novelty of
Rules Mined from Text [Abstract] [Gzipped
PS] [PDF]
Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti and Joydeep Ghosh
Proceedings of NAACL 2001 Workshop on WordNet and Other
Lexical Resources: Applications, Extensions and
Customizations, Pittsburg, PA, June 2001.
Optimization in VLSI Testing
- Synthesis of Low Power CED Circuits Based on Parity Codes [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Proceedings of the VLSI Test
Symposium (VTS-2005), Palm Springs, CA, May 2005.
- Reducing Power Consumption in Memory ECC Checkers [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Proceedings of the IEEE International Test
Conference (ITC-2004), Charlotte, NC, October 2004.
Followup journal version: Selecting Error Correcting Codes to Minimize Power
in Memory Checker Circuits [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Journal of Low Power Testing, 2005.
Low-power Weighted Pseudo-random BIST Using Special Scan Cells [PDF]
Shalini Ghosh, Eric McDonald, Sugato Basu and Nur A. Touba
Proceedings of the Great Lakes Symposium on VLSI
(GLSVLSI-2004), Boston, MA, April 2004.
Joint Minimization of Power and Area in Scan Testing by Scan Cell
Re-ordering [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Proceedings of the IEEE Computer Society Annual Symposium on
VLSI (ISVLSI-2003), Tampa, FL, February 2003.
PhD Proposal
- Semi-supervised Clustering: Learning with Limited User Feedback [Abstract]
[Gzipped
PS] [PDF]
Sugato Basu
Ph.D. proposal, Department of Computer Sciences, University of Texas at
Austin, Nov 2003.
Also appears as Technical Report UT-AI-TR-03-307, Artificial Intelligence
Lab, University of Texas at Austin, January 2004.
MS Thesis
- Data Structures For A Mini-Threading Algorithm For Protein Structure
Prediction [Gzipped PS]
[PDF]
Sugato Basu
M.S. thesis, Computer Engineering Department, University of California at
Santa Cruz, July 2000.