Publications by TYPE
(Organized by TYPE)
(Go to complete list by TOPIC)

Publications in Progress

  • Estimating the Global PageRank of Web Communities
    J. Davis and I. S. Dhillon
    Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 116-125, August 2006.
    Download: [ pdf]
  • A Unified View of Kernel k-means, Spectral Clustering and Graph Partitioning
    I. S. Dhillon, Y. Guan, and B. Kulis
    Submitted for publication, January 2005.
    (Also, appears as UTCS Technical Report #TR-04-25, June 30, 2004. [Abstract & Download])
  • Matrix Nearness Problems using Bregman Divergences
    I. S. Dhillon and J. A. Tropp
    submitted for publication, 2006.
    Download: [ pdf]
  • Co-clustering of Human Cancer Microarrays using Minimum Sum-Squared Residue Co-clustering
    H. Cho and I. S. Dhillon
    submitted for publication, 2006.
  • On The Existence of Equiangular Uniform Frames
    M. A. Sustik, J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr.
    Submitted for publication to Linear Algebra and its Applications, 2005.
    Download: [ps, pdf]
  • Constructing Grassmannian Packings via Alternating Projections
    J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer
    In preparation, 2005.
  • Journal Publications

  • Clustering with Bregman Divergences
    A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh
    Journal of Machine Learning Research, vol. 6, pages 1705-1749, October 2005.
    Download: [pdf]
  • Clustering on the Unit Hypersphere using von Mises-Fisher Distributions
    A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra
    Journal of Machine Learning Research, vol. 6, pages 1345-1382, September 2005.
    Download: [pdf]
  • Generalized Finite Algorithms for Constructing Hermitian Matrices with Prescribed Diagonal and Spectrum
    I. S. Dhillon, R. W. Heath Jr., M. A. Sustik, and J. A. Tropp
    SIAM Journal on Matrix Analysis and Applications, vol. 27, no. 1, pages 61-71, June 2005.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-03-49, December 2003. [Abstract & Download])
  • Designing Structured Tight Frames Via An Alternating Projection Method
    J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer
    IEEE Transactions on Information Theory, vol. 51, no. 1, pages 188-209, January 2005.
    Download: [pdf]
    (Also, appears as ICES Report 03-50, UT, December 2003. [Abstract & Download])
  • Finite-Step Algorithms for Constructing Optimal CDMA Signature Sequences
    J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr.
    IEEE Transactions on Information Theory, vol. 50, no. 11, pages 2916-2921, November 2004.
    Download: [pdf]
    (Also, appears as WNCG-TR-2003-05-08, UT, June 2003. [Abstract & Download])
  • Diametrical Clustering for identifying Anti-Correlated Gene Clusters
    I. S. Dhillon, E. M. Marcotte, and U. Roshan
    Bioinformatics, vol. 19, no. 13, pages 1612-1619, 2003.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-02-49, September 2002. [Abstract & Download])
  • A Divisive Information-Theoretic Feature Clustering Algorithm for Text Classification
    I. S. Dhillon, S. Mallela, and R. Kumar
    Journal of Machine Learning Research(JMLR), vol. 3, pages 1265-1287, March 2003.
    Download: [ps, pdf]
  • Class Visualization of High-Dimensional Data with Applications
    I. S. Dhillon, D. S. Modha, and W. S. Spangler
    Computational Statistics & Data Analysis (Special issue on Matrix Computations & Statistics), vol. 4:1, pages 59-90, 2002.
    Download: [ps, pdf]
    Software is available here.
  • Concept Decompositions for Large Sparse Text Data using Clustering
    I. S. Dhillon and D. S. Modha
    Machine Learning, vol. 42:1, pages 143-175, January 2001. [Abstract]
    Download: [ps, pdf]
    (An earlier version appears as IBM Research Report RJ 10147, July 8, 1999.)
  • Conference Publications

  • Learning Low-Rank Kernel Matrices
    B. Kulis, M. A. Sustik, and I. S. Dhillon
    Proceedings of the Twenty-third International Conference on Machine Learning (ICML), pages 505-512, June 2006.
    Download: [ pdf]
  • Generalized Nonnegative Matrix Approximations with Bregman Divergences
    I. S. Dhillon and S. Sra
    NIPS, pages 283-290, Vancouver Canada, December 2005.
    (Also, appears as UTCS Technical Report #TR-05-31, June 1, 2005. [Abstract & Download])
  • A Fast Kernel-based Multilevel Algorithm for Graph Clustering
    I. S. Dhillon, Y. Guan, and B. Kulis
    Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 629-634, August 2005.
    Download: [ps, pdf]
  • Semi-supervised Graph Clustering: A Kernel Approach
    B. Kulis, S. Basu, I. S. Dhillon, and R. Mooney
    Proceedings of the Twenty-second International Conference on Machine Learning(ICML), pages 457-464, August 2005.
    Download: [ps, pdf]
  • Triangle Fixing Algorithms for the Metric Nearness Problem
    I. S. Dhillon, S. Sra, and J. A. Tropp
    Eighteenth Annual Conference on Neural Information Processing Systems (NIPS), pages 361-368, December 2004.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-04-22, June 2004. [Abstract & Download])
  • Construction of Equiangular Signatures for Synchronous CDMA Systems
    R. W. Heath Jr., J. A. Tropp, I. S. Dhillon, and T. Strohmer
    Proceedings of IEEE International Symposium on Spread Spectrum Techniques and Applications, August 2004.
    Download: [pdf]
  • Optimal CDMA Signatures: A Finite-Step Approach
    J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr.
    Proceedings of IEEE International Symposium on Spread Spectrum Techniques and Applications, August 2004.
    Download: [pdf]
  • Kernel k-means, Spectral Clustering and Normalized Cuts
    I. S. Dhillon, Y. Guan, and B. Kulis
    Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 551-556, August 2004.
    Download: [ps, pdf]
    (A longer version appears as UTCS Technical Report #TR-04-25, June 30, 2004. [Abstract & Download])
  • A Generalized Maximum Entropy Approach to Bregman Co-Clustering and Matrix Approximations
    A. Banerjee, I. S. Dhillon, J. Ghosh, S. Merugu, and D. S. Modha
    Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 509-514, August 2004.
    Download: [ps, pdf]
  • An Information Theoretic Analysis of Maximum Likelihood Mixture Estimation for Exponential Families
    A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Merugu
    Proceedings of the Twenty-First International Conference on Machine Learning(ICML), pages 57-64, July 2004
    Download: [ps, pdf]
  • Clustering with Bregman Divergences
    A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh
    Proceedings of the Fourth SIAM International Conference on Data Mining, pages 234-245, April 2004
    Download: [ps, pdf]
  • Minimum Sum-Squared Residue Co-clustering of Gene Expression Data
    H. Cho, I. S. Dhillon, Y. Guan, and S. Sra
    Proceedings of the Fourth SIAM International Conference on Data Mining, pages 114-125, April 2004
    Download: [ps, pdf]
  • CDMA Signature Sequences with Low Peak-To-Average Ratio via Alternating Projection
    J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer
    Proceedings of the Thirty-Seventh IEEE Asilomar Conference on Signals, Systems, and Computers, pages 475-479, November 2003.
    Download: [pdf]
  • Information Theoretic Clustering of Sparse Co-Occurrence Data
    I. S. Dhillon and Y. Guan
    Proceedings of the Third IEEE International Conference on Data Mining, pages 517-520, November 2003
    Download: [ps, pdf]
    (A longer version appears as UTCS Technical Report #TR-03-39, September 2003. [Abstract & Download])
    (Also, appears as "Clustering Large and Sparse Co-Occurrence Data", Workshop on Clustering High-Dimensional Data and its Applications at the Third SIAM International Conference on Data Mining, May 2003. Download: [ps, pdf])
  • Information-Theoretic Co-clustering
    I. S. Dhillon, S. Mallela, and D. S. Modha
    Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 89-98, August 2003.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-03-12, April 2003. [Abstract & Download])
  • Generative Model-based Clustering of Directional Data
    A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra
    Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 19-28, August 2003.
    Download: [ps, pdf]
  • Iterative Clustering of High Dimensional Text Data Augmented by Local Search
    I. S. Dhillon, Y. Guan, and J. Kogan
    Proceedings of the 2nd IEEE International Conference on Data Mining, pages 131-138, Maebishi, Japan, December 2002.
    Download: [ps, pdf]
  • Enhanced Word Clustering for Hierarchical Text Classification
    I. S. Dhillon, S. Mallela, and R. Kumar
    Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), July 2002.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-02-17, March 2002. [Abstract & Download])
  • Refining Clusters in High-dimensional Text Data
    I. S. Dhillon, Y. Guan, and J. Kogan
    2nd SIAM International Conference on Data Mining (Workshop on Clustering High-Dimensional Data and its Applications), April 2002.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report # TR-02-03, January 2002. [Abstract & Download])
  • Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning
    I. S. Dhillon
    Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), August 26-29, 2001, San Francisco, California, USA
    Download: [ps, pdf]
    (Also, a longer version appears as UTCS Technical Report #TR-01-05, March 2001. [Abstract & Download])
  • Visualizing Class Structure of Multidimensional Data
    I. S. Dhillon, D. S. Modha, and W. S. Spangler
    Proceedings of the Thirtieth Symposium on the Interface: Computing Science and Statistics, Interface Foundation of North America, vol. 30, pages 488-493, Minneapolis, May 1998.
    Download: [ps]
  • Book Chapters

  • Feature Selection and Document Clustering
    I. S. Dhillon, J. Kogan, and M. Nicholas
    Book chapter in Text Data Mining and Applications, 2002.
  • Efficient Clustering of Very Large Document Collections
    I. S. Dhillon, Y. Guan, and J. Fan
    Invited book chapter in Data Mining for Scientific and Engineering Applications, pages 357-381, 2001.
    Download: [ps, pdf, HTML]
  • A Data Clustering Algorithm on Distributed Memory Multiprocessors
    I. S. Dhillon and D. S. Modha
    In Large-Scale Parallel Data Mining, Lecture Notes in Artificial Intelligence, vol. 1759, Springer-Verlag, pages 245-260, March 2000.
    Download: [ps, pdf]
    (Also, appears as IBM Research Report RJ 10134)
  • Books

    Technical Reports

  • Generalized Nonnegative Matrix Approximations with Bregman Divergences
    I. S. Dhillon and Suvrit Sra
    To appear in NIPS, Vancouver Canada, December 2005.
    (Also, appears as UTCS Technical Report #TR-05-31, June 1, 2005. [Abstract & Download])
  • A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts
    I. S. Dhillon, Yuqiang Guan, and Brian Kulis
    UTCS Technical Report #TR-04-25, June 2004. [Abstract & Download]
  • Triangle Fixing Algorithms for the Metric Nearness Problem
    I. S. Dhillon, S. Sra, and J. A. Tropp
    Eighteenth Annual Conference on Neural Information Processing Systems (NIPS), pages 361-368, December 2004.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-04-22, June 2004. [Abstract & Download])
  • Generalized Finite Algorithms for Constructing Hermitian Matrices with Prescribed Diagonal and Spectrum
    I. S. Dhillon, R. W. Heath Jr., M. A. Sustik, and J. A. Tropp
    SIAM Journal on Matrix Analysis and Applications, vol. 27, no. 1, June 2005.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-03-49, December 2003. [Abstract & Download])
  • Designing Structured Tight Frames Via An Alternating Projection Method
    J. A. Tropp, I. S. Dhillon, R. W. Heath Jr., and T. Strohmer
    IEEE Transactions on Information Theory, vol. 51, no. 1, pages 188-209, January 2005.
    Download: [pdf]
    (Also, appears as ICES Report 03-50, UT, December 2003. [Abstract & Download])
  • Information Theoretic Clustering of Sparse Co-Occurrence Data
    I. S. Dhillon and Y. Guan
    Proceedings of the Third IEEE International Conference on Data Mining, pages 517-521, November 2003
    Download: [ps, pdf]
    (A longer version appears as UTCS Technical Report #TR-03-39, September 2003. [Abstract & Download])
    (Also, appears as "Clustering Large and Sparse Co-Occurrence Data", Workshop on Clustering High-Dimensional Data and its Applications at the Third SIAM International Conference on Data Mining, May 2003. Download: [ps, pdf])
  • Finite-Step Algorithms for Constructing Optimal CDMA Signature Sequences
    J. A. Tropp, I. S. Dhillon, and R. W. Heath Jr.
    IEEE Transactions on Information Theory, vol. 50, no. 11, pages 2916-2921, November 2004.
    Download: [pdf]
    (Also, appears as WNCG-TR-2003-05-08, UT, June 2003. [Abstract & Download])
  • The Metric Nearness Problem with Applications
    I. S. Dhillon, S. Sra, and J. A. Tropp
    UTCS Technical Report #TR-03-23, July 2003. [Abstract & Download]
  • Information-Theoretic Co-clustering
    I. S. Dhillon, S. Mallela, and D. S. Modha
    Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 89-98, August 2003.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-03-12, April 2003. [Abstract & Download])
  • Expectation Maximization for Clustering on Hyperspheres
    A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra
    UTCS Technical Report #TR-03-07, February 2003. [Abstract & Download]
  • Modeling data using Directional Distributions
    I. S. Dhillon and S. Sra
    UTCS Technical Report #TR-03-06, January 2003. [Abstract & Download]
  • Diametrical Clustering for identifying Anti-Correlated Gene Clusters
    I. S. Dhillon, E. M. Marcotte, and U. Roshan
    Bioinformatics, vol. 19, no. 13, pages 1612-1619, 2003.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-02-49, September 2002. [Abstract & Download])
  • Refining Clusters in High-dimensional Text Data
    I. S. Dhillon, Y. Guan, and J. Kogan
    2nd SIAM International Conference on Data Mining (Workshop on Clustering High-Dimensional Data and its Applications), April 2002.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-02-03, January 2002. [Abstract & Download])
  • Enhanced Word Clustering for Hierarchical Text Classification
    I. S. Dhillon, S. Mallela, and R. Kumar
    Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), July 2002.
    Download: [ps, pdf]
    (Also, appears as UTCS Technical Report #TR-02-17, March 2002. [Abstract & Download])
  • Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning
    I. S. Dhillon
    Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), August 26-29, 2001, San Francisco, California, USA
    Download: [ps, pdf]
    (Also, a longer version appears as UTCS Technical Report #TR-01-05, March 2001. [Abstract & Download])
  • A Data Clustering Algorithm on Distributed Memory Multiprocessors
    I. S. Dhillon and D. S. Modha
    KDD, August 1999.
    Download: [ps, pdf]
    (Also, Large-Scale Parallel Data Mining, Lecture Notes in Artificial Intelligence, Volume 1759, pages 245-260, 2000.)
    (Also, appears as IBM Research Report RJ 10134)
  • Concept Decompositions for Large Sparse Text Data using Clustering
    I. S. Dhillon and D. S. Modha
    Machine Learning, vol. 42:1, pages 143-175, January 2001.
    Download: [ps, pdf]
    (An earlier version appears as IBM Research Report RJ 10147, July 8, 1999.)