Computational Advertising
- Hierarchical Mixtures of GLMs for Combining Multiple Ground Truths
[PDF]
Joseph Reisinger, Sugato Basu, Roberto Bayardo
Proceedings of NIPS Workshop on Domain Adaptation, 2011.
- User Browsing Models: Relevance versus Examination
[PDF]
Ramakrishnan Srikant, Sugato Basu, Ni Wang, Daryl Pregibon
Proceedings of 16th International ACM-SIGKDD Conference on Knowledge Discovery
and Data Mining (KDD-2010), 2010.
- PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce
[PDF]
Biswanath Panda, Joshua S. Herbach, Sugato Basu, Roberto J. Bayardo
Proceedings of the 35th International Conference on Very Large Data Bases
(VLDB-2009), 2009.
- Predicting Bounce Rates in Sponsored Search Advertisements
[PDF]
D. Sculley, Robert Malkin, Sugato Basu, Roberto J. Bayardo
Proceedings of the 15th International ACM-SIGKDD Conference on Knowledge Discovery
and Data Mining (KDD-2009), 2009, pp. 1325-1334.
Social Networks
- A Social Query Model for Decentralized Search [PDF]
Arindam Banerjee, Sugato Basu
Second ACM Workshop on Social Network Mining and Analysis at the KDD
Conference (SNAKDD-08), Las Vegas, Nevada, August 2008.
- iLink: Search and Routing in Social Networks [PDF]
Jeffrey Davitz, Jiye Yu, Sugato Basu, David Gutelius, Alexandra Harris
Proceedings of the Knowledge Discovery and Data Mining (KDD)
Conference
(KDD-2007), San Jose, California, August 2007.
Book
- Constrained Clustering: Advances in Algorithms, Theory, and
Applications [Book website]
[Buy on Amazon]
Editors: Sugato Basu, Ian Davidson, Kiri L. Wagstaff
Publisher: CRC Press
Year: 2008
Book Chapters
- Constrained Partitional Clustering of Text Data: An Overview
Sugato Basu and Ian Davidson
In Text Mining: Classification, Clustering, and Applications
Editors: Ashok Srivastava and Mehran Sahami
Publisher: CRC Press
Year: 2009
- Probabilistic Semi-supervised Clustering with Constraints [PDF]
[Code]
Sugato Basu, Mikhail Bilenko, Arindam Banerjee, Raymond Mooney
In Semi-Supervised Learning
Editors: Olivier Chapelle, Bernhard Scholkopf, Alexander Zien
Publisher: MIT Press
Year: 2006
PhD Thesis
- Semi-supervised Clustering: Probabilistic Models, Algorithms and
Experiments [PDF]
Sugato Basu
Ph.D. thesis, Department of Computer Sciences, University of Texas at
Austin, 2005.
Clustering
- A Survey of Clustering with Instance Level Constraints [PDF]
Ian Davidson and Sugato Basu
In ACM Transactions on Knowledge Discovery from Data, 1-41, 2007.
- Multi-way Clustering on Relation Graphs [PDF]
Arindam Banerjee, Sugato Basu, Srujana Merugu
Proceedings of the SIAM International Conference on Data Mining
(SDM-2007), Minneapolis, Minnesota, April 2007.
(Best of SDM'07 Award)
- Topic Models over Text Streams: A Study of Batch and Online
Unsupervised Learning [PDF]
Arindam Banerjee, Sugato Basu
Proceedings of the SIAM International Conference on Data Mining
(SDM-2007), Minneapolis, Minnesota, April 2007.
Longer version (with more details in model description, experiments and
related work sections): [PDF]
- Measuring Constraint-Set Utility for Partitional Clustering
Algorithms [PDF]
Ian Davidson, Kiri L. Wagstaff, and Sugato Basu.
Proceedings of the Tenth European Conference on Principles and
Practice of Knowledge Discovery in Databases (PKDD-2006), Berlin,
Germany, September 2006.
- Model-based Overlapping Clustering [PDF]
Arindam Banerjee, Chase Krumpelman, Sugato Basu, Raymond Mooney, Joydeep Ghosh
Proceedings of the 11th ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining
(KDD-2005), Chicago, IL, August 2005.
- Semi-supervised Graph Clustering: A Kernel Approach [PDF]
Brian Kulis, Sugato Basu, Inderjit Dhillon, Raymond J. Mooney
Proceedings of the 22nd
International Conference on Machine Learning
(ICML-2005), Bonn, Germany, August 2005.
(Distinguished Student Paper Award)
- A Probabilistic Framework for Semi-Supervised Clustering [PDF]
[Code]
Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney
Proceedings of the 10th ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining
(KDD-2004), Seattle, WA, August 2004.
(Best Research Paper Award)
- Integrating Constraints and Metric Learning in Semi-Supervised
Clustering [PDF]
Mikhail Bilenko, Sugato Basu, and Raymond J. Mooney
Proceedings of the 21st International Conference on
Machine Learning (ICML-2004), Banff, Canada, July 2004.
Earlier workshop version: Comparing and Unifying Search-Based and Similarity-Based Approaches to
Semi-Supervised Clustering [PDF]
Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney
Proceedings of the ICML-2003 Workshop on the Continuum from Labeled to
Unlabeled Data in Machine Learning and Data Mining, pp.42-49,
Washington DC, August 2003.
- A Comparison of Inference Techniques for Semi-supervised Clustering with
Hidden Markov Random Fields
[PDF]
Mikhail Bilenko, and Sugato Basu
Proceedings of the ICML-2004 Workshop on Statistical Relational Learning
and its Connections to Other Fields (SRL-2004),
Banff, Canada, July 2004.
- Semi-supervised Clustering with Limited Background Knowledge [PDF]
Sugato Basu
Proceedings of the 9th AAAI/SIGART Doctoral Consortium
, San Jose, California, July 2004.
- Active Semi-Supervision for Pairwise Constrained Clustering [PDF]
Sugato Basu, Arindam Banerjee, and Raymond J. Mooney
Proceedings of the SIAM International Conference on Data
Mining (SDM-2004), Lake Buena Vista, FL, April 2004.
- Semi-supervised Clustering for Intelligent User Management [PDF]
Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney
Proceedings of the IBM Austin Center for Advanced Studies 5th Annual
Austin CAS Conference, Austin, TX, February 2004.
- Semi-supervised Clustering by Seeding [PDF]
Sugato Basu, Arindam Banerjee, and Raymond J. Mooney
Proceedings of the 19th International Conference on Machine
Learning (ICML-2002), pp. 19-26, Sydney, Australia, July 2002.
Record Linkage
- Adaptive Product Normalization: Using Online Learning for Record Linkage
in Comparison Shopping [PDF]
Mikhail Bilenko, Sugato Basu, Mehran Sahami
Proceedings of the 5th IEEE International Conference on
Data Mining (ICDM-2005)
Rule Mining
- Evaluating the Novelty of Text-Mined Rules using Lexical Knowledge [PDF]
Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti, and Joydeep Ghosh
Proceedings of the 7th ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining (KDD-2001), pp.
233-238, San Francisco, CA, August 2001.
Earlier workshop version: Using Lexical Knowlege to Evaluate the Novelty of
Rules Mined from Text [PDF]
Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti and Joydeep Ghosh
Proceedings of NAACL 2001 Workshop on WordNet and Other
Lexical Resources: Applications, Extensions and
Customizations, Pittsburg, PA, June 2001.
Optimization in VLSI Testing
- Synthesis of Low Power CED Circuits Based on Parity Codes [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Proceedings of the VLSI Test
Symposium (VTS-2005), Palm Springs, CA, May 2005.
- Reducing Power Consumption in Memory ECC Checkers [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Proceedings of the IEEE International Test
Conference (ITC-2004), Charlotte, NC, October 2004.
Followup journal version: Selecting Error Correcting Codes to Minimize Power
in Memory Checker Circuits [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Journal of Low Power Testing, 2005.
Low-power Weighted Pseudo-random BIST Using Special Scan Cells [PDF]
Shalini Ghosh, Eric McDonald, Sugato Basu and Nur A. Touba
Proceedings of the Great Lakes Symposium on VLSI
(GLSVLSI-2004), Boston, MA, April 2004.
Joint Minimization of Power and Area in Scan Testing by Scan Cell
Re-ordering [PDF]
Shalini Ghosh, Sugato Basu and Nur A. Touba
Proceedings of the IEEE Computer Society Annual Symposium on
VLSI (ISVLSI-2003), Tampa, FL, February 2003.
PhD Proposal
- Semi-supervised Clustering: Learning with Limited User Feedback [PDF]
Sugato Basu
Ph.D. proposal, Department of Computer Sciences, University of Texas at
Austin, Nov 2003.
Also appears as Technical Report UT-AI-TR-03-307, Artificial Intelligence
Lab, University of Texas at Austin, January 2004.
MS Thesis
- Data Structures For A Mini-Threading Algorithm For Protein Structure
Prediction [PDF]
Sugato Basu
M.S. thesis, Computer Engineering Department, University of California at
Santa Cruz, July 2000.