(Organized by TYPE) (Go to complete list by TOPIC) Publications in Progress Journal Publications Conference Publications Book Chapters Books Technical Reports Publications in Progress Estimating the Global PageRank of Web Communities J. Davis and I. S. Dhillon Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 116-125, August 2006. Download: [ pdf] A Unified View of Kernel k-means, Spectral Clustering and Graph Partitioning I. S. Dhillon, Y. Guan, and B. Kulis Submitted for publication, January 2005. (Also, appears as UTCS Technical Report #TR-04-25, June 30, 2004. [Abstract & Download]) Matrix Nearness Problems using Bregman Divergences I. S. Dhillon and J. A. Tropp submitted for publication, 2006. Download: [ pdf] Co-clustering of Human Cancer Microarrays using Minimum Sum-Squared Residue Co-clustering H. Cho and I. S. Dhillon submitted for publication, 2006. On The Existence of Equiangular Uniform Frames M. A. Sustik, J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr. Submitted for publication to Linear Algebra and its Applications, 2005. Download: [ps, pdf] Constructing Grassmannian Packings via Alternating Projections J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer In preparation, 2005. Back to Top Journal Publications Clustering with Bregman Divergences A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh Journal of Machine Learning Research, vol. 6, pages 1705-1749, October 2005. Download: [pdf] Clustering on the Unit Hypersphere using von Mises-Fisher Distributions A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra Journal of Machine Learning Research, vol. 6, pages 1345-1382, September 2005. Download: [pdf] Generalized Finite Algorithms for Constructing Hermitian Matrices with Prescribed Diagonal and Spectrum I. S. Dhillon, R. W. Heath Jr., M. A. Sustik, and J. A. Tropp SIAM Journal on Matrix Analysis and Applications, vol. 27, no. 1, pages 61-71, June 2005. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-03-49, December 2003. [Abstract & Download]) Designing Structured Tight Frames Via An Alternating Projection Method J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer IEEE Transactions on Information Theory, vol. 51, no. 1, pages 188-209, January 2005. Download: [pdf] (Also, appears as ICES Report 03-50, UT, December 2003. [Abstract & Download]) Finite-Step Algorithms for Constructing Optimal CDMA Signature Sequences J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr. IEEE Transactions on Information Theory, vol. 50, no. 11, pages 2916-2921, November 2004. Download: [pdf] (Also, appears as WNCG-TR-2003-05-08, UT, June 2003. [Abstract & Download]) Diametrical Clustering for identifying Anti-Correlated Gene Clusters I. S. Dhillon, E. M. Marcotte, and U. Roshan Bioinformatics, vol. 19, no. 13, pages 1612-1619, 2003. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-02-49, September 2002. [Abstract & Download]) A Divisive Information-Theoretic Feature Clustering Algorithm for Text Classification I. S. Dhillon, S. Mallela, and R. Kumar Journal of Machine Learning Research(JMLR), vol. 3, pages 1265-1287, March 2003. Download: [ps, pdf] Class Visualization of High-Dimensional Data with Applications I. S. Dhillon, D. S. Modha, and W. S. Spangler Computational Statistics & Data Analysis (Special issue on Matrix Computations & Statistics), vol. 4:1, pages 59-90, 2002. Download: [ps, pdf] Software is available here. Concept Decompositions for Large Sparse Text Data using Clustering I. S. Dhillon and D. S. Modha Machine Learning, vol. 42:1, pages 143-175, January 2001. [Abstract] Download: [ps, pdf] (An earlier version appears as IBM Research Report RJ 10147, July 8, 1999.) Back to Top Conference Publications Learning Low-Rank Kernel Matrices B. Kulis, M. A. Sustik, and I. S. Dhillon Proceedings of the Twenty-third International Conference on Machine Learning (ICML), pages 505-512, June 2006. Download: [ pdf] Generalized Nonnegative Matrix Approximations with Bregman Divergences I. S. Dhillon and S. Sra NIPS, pages 283-290, Vancouver Canada, December 2005. (Also, appears as UTCS Technical Report #TR-05-31, June 1, 2005. [Abstract & Download]) A Fast Kernel-based Multilevel Algorithm for Graph Clustering I. S. Dhillon, Y. Guan, and B. Kulis Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 629-634, August 2005. Download: [ps, pdf] Semi-supervised Graph Clustering: A Kernel Approach B. Kulis, S. Basu, I. S. Dhillon, and R. Mooney Proceedings of the Twenty-second International Conference on Machine Learning(ICML), pages 457-464, August 2005. Download: [ps, pdf] Triangle Fixing Algorithms for the Metric Nearness Problem I. S. Dhillon, S. Sra, and J. A. Tropp Eighteenth Annual Conference on Neural Information Processing Systems (NIPS), pages 361-368, December 2004. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-04-22, June 2004. [Abstract & Download]) Construction of Equiangular Signatures for Synchronous CDMA Systems R. W. Heath Jr., J. A. Tropp, I. S. Dhillon, and T. Strohmer Proceedings of IEEE International Symposium on Spread Spectrum Techniques and Applications, August 2004. Download: [pdf] Optimal CDMA Signatures: A Finite-Step Approach J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr. Proceedings of IEEE International Symposium on Spread Spectrum Techniques and Applications, August 2004. Download: [pdf] Kernel k-means, Spectral Clustering and Normalized Cuts I. S. Dhillon, Y. Guan, and B. Kulis Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 551-556, August 2004. Download: [ps, pdf] (A longer version appears as UTCS Technical Report #TR-04-25, June 30, 2004. [Abstract & Download]) A Generalized Maximum Entropy Approach to Bregman Co-Clustering and Matrix Approximations A. Banerjee, I. S. Dhillon, J. Ghosh, S. Merugu, and D. S. Modha Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 509-514, August 2004. Download: [ps, pdf] An Information Theoretic Analysis of Maximum Likelihood Mixture Estimation for Exponential Families A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Merugu Proceedings of the Twenty-First International Conference on Machine Learning(ICML), pages 57-64, July 2004 Download: [ps, pdf] Clustering with Bregman Divergences A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh Proceedings of the Fourth SIAM International Conference on Data Mining, pages 234-245, April 2004 Download: [ps, pdf] Minimum Sum-Squared Residue Co-clustering of Gene Expression Data H. Cho, I. S. Dhillon, Y. Guan, and S. Sra Proceedings of the Fourth SIAM International Conference on Data Mining, pages 114-125, April 2004 Download: [ps, pdf] CDMA Signature Sequences with Low Peak-To-Average Ratio via Alternating Projection J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer Proceedings of the Thirty-Seventh IEEE Asilomar Conference on Signals, Systems, and Computers, pages 475-479, November 2003. Download: [pdf] Information Theoretic Clustering of Sparse Co-Occurrence Data I. S. Dhillon and Y. Guan Proceedings of the Third IEEE International Conference on Data Mining, pages 517-520, November 2003 Download: [ps, pdf] (A longer version appears as UTCS Technical Report #TR-03-39, September 2003. [Abstract & Download]) (Also, appears as "Clustering Large and Sparse Co-Occurrence Data", Workshop on Clustering High-Dimensional Data and its Applications at the Third SIAM International Conference on Data Mining, May 2003. Download: [ps, pdf]) Information-Theoretic Co-clustering I. S. Dhillon, S. Mallela, and D. S. Modha Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 89-98, August 2003. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-03-12, April 2003. [Abstract & Download]) Generative Model-based Clustering of Directional Data A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 19-28, August 2003. Download: [ps, pdf] Iterative Clustering of High Dimensional Text Data Augmented by Local Search I. S. Dhillon, Y. Guan, and J. Kogan Proceedings of the 2nd IEEE International Conference on Data Mining, pages 131-138, Maebishi, Japan, December 2002. Download: [ps, pdf] Enhanced Word Clustering for Hierarchical Text Classification I. S. Dhillon, S. Mallela, and R. Kumar Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), July 2002. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-02-17, March 2002. [Abstract & Download]) Refining Clusters in High-dimensional Text Data I. S. Dhillon, Y. Guan, and J. Kogan 2nd SIAM International Conference on Data Mining (Workshop on Clustering High-Dimensional Data and its Applications), April 2002. Download: [ps, pdf] (Also, appears as UTCS Technical Report # TR-02-03, January 2002. [Abstract & Download]) Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning I. S. Dhillon Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), August 26-29, 2001, San Francisco, California, USA Download: [ps, pdf] (Also, a longer version appears as UTCS Technical Report #TR-01-05, March 2001. [Abstract & Download]) Visualizing Class Structure of Multidimensional Data I. S. Dhillon, D. S. Modha, and W. S. Spangler Proceedings of the Thirtieth Symposium on the Interface: Computing Science and Statistics, Interface Foundation of North America, vol. 30, pages 488-493, Minneapolis, May 1998. Download: [ps] Back to Top Book Chapters Feature Selection and Document Clustering I. S. Dhillon, J. Kogan, and M. Nicholas Book chapter in Text Data Mining and Applications, 2002. Efficient Clustering of Very Large Document Collections I. S. Dhillon, Y. Guan, and J. Fan Invited book chapter in Data Mining for Scientific and Engineering Applications, pages 357-381, 2001. Download: [ps, pdf, HTML] A Data Clustering Algorithm on Distributed Memory Multiprocessors I. S. Dhillon and D. S. Modha In Large-Scale Parallel Data Mining, Lecture Notes in Artificial Intelligence, vol. 1759, Springer-Verlag, pages 245-260, March 2000. Download: [ps, pdf] (Also, appears as IBM Research Report RJ 10134) Back to Top Books Back to Top Technical Reports Generalized Nonnegative Matrix Approximations with Bregman Divergences I. S. Dhillon and Suvrit Sra To appear in NIPS, Vancouver Canada, December 2005. (Also, appears as UTCS Technical Report #TR-05-31, June 1, 2005. [Abstract & Download]) A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts I. S. Dhillon, Yuqiang Guan, and Brian Kulis UTCS Technical Report #TR-04-25, June 2004. [Abstract & Download] Triangle Fixing Algorithms for the Metric Nearness Problem I. S. Dhillon, S. Sra, and J. A. Tropp Eighteenth Annual Conference on Neural Information Processing Systems (NIPS), pages 361-368, December 2004. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-04-22, June 2004. [Abstract & Download]) Generalized Finite Algorithms for Constructing Hermitian Matrices with Prescribed Diagonal and Spectrum I. S. Dhillon, R. W. Heath Jr., M. A. Sustik, and J. A. Tropp SIAM Journal on Matrix Analysis and Applications, vol. 27, no. 1, June 2005. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-03-49, December 2003. [Abstract & Download]) Designing Structured Tight Frames Via An Alternating Projection Method J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer IEEE Transactions on Information Theory, vol. 51, no. 1, pages 188-209, January 2005. Download: [pdf] (Also, appears as ICES Report 03-50, UT, December 2003. [Abstract & Download]) Information Theoretic Clustering of Sparse Co-Occurrence Data I. S. Dhillon and Y. Guan Proceedings of the Third IEEE International Conference on Data Mining, pages 517-521, November 2003 Download: [ps, pdf] (A longer version appears as UTCS Technical Report #TR-03-39, September 2003. [Abstract & Download]) (Also, appears as "Clustering Large and Sparse Co-Occurrence Data", Workshop on Clustering High-Dimensional Data and its Applications at the Third SIAM International Conference on Data Mining, May 2003. Download: [ps, pdf]) Finite-Step Algorithms for Constructing Optimal CDMA Signature Sequences J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr. IEEE Transactions on Information Theory, vol. 50, no. 11, pages 2916-2921, November 2004. Download: [pdf] (Also, appears as WNCG-TR-2003-05-08, UT, June 2003. [Abstract & Download]) The Metric Nearness Problem with Applications I. S. Dhillon, S. Sra, and J. A. Tropp UTCS Technical Report #TR-03-23, July 2003. [Abstract & Download] Information-Theoretic Co-clustering I. S. Dhillon, S. Mallela, and D. S. Modha Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 89-98, August 2003. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-03-12, April 2003. [Abstract & Download]) Expectation Maximization for Clustering on Hyperspheres A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra UTCS Technical Report #TR-03-07, February 2003. [Abstract & Download] Modeling data using Directional Distributions I. S. Dhillon and S. Sra UTCS Technical Report #TR-03-06, January 2003. [Abstract & Download] Diametrical Clustering for identifying Anti-Correlated Gene Clusters I. S. Dhillon, E. M. Marcotte, and U. Roshan Bioinformatics, vol. 19, no. 13, pages 1612-1619, 2003. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-02-49, September 2002. [Abstract & Download]) Refining Clusters in High-dimensional Text Data I. S. Dhillon, Y. Guan, and J. Kogan 2nd SIAM International Conference on Data Mining (Workshop on Clustering High-Dimensional Data and its Applications), April 2002. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-02-03, January 2002. [Abstract & Download]) Enhanced Word Clustering for Hierarchical Text Classification I. S. Dhillon, S. Mallela, and R. Kumar Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), July 2002. Download: [ps, pdf] (Also, appears as UTCS Technical Report #TR-02-17, March 2002. [Abstract & Download]) Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning I. S. Dhillon Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), August 26-29, 2001, San Francisco, California, USA Download: [ps, pdf] (Also, a longer version appears as UTCS Technical Report #TR-01-05, March 2001. [Abstract & Download]) A Data Clustering Algorithm on Distributed Memory Multiprocessors I. S. Dhillon and D. S. Modha KDD, August 1999. Download: [ps, pdf] (Also, Large-Scale Parallel Data Mining, Lecture Notes in Artificial Intelligence, Volume 1759, pages 245-260, 2000.) (Also, appears as IBM Research Report RJ 10134) Concept Decompositions for Large Sparse Text Data using Clustering I. S. Dhillon and D. S. Modha Machine Learning, vol. 42:1, pages 143-175, January 2001. Download: [ps, pdf] (An earlier version appears as IBM Research Report RJ 10147, July 8, 1999.) Back to Top