Workshop on Clustering High Dimensional Data and its Applications
April 23, 2005
Sutton Place Hotel
Newport Beach, California

To be Held in Conjunction with the
Fifth SIAM International Conference on Data Mining (SDM 2005)


Call for Sponsorship | Workshop Theme | Topics of interest | Registration | Submission Requirements
Important Dates | Schedule | Program Committee | Organizing Committee

Workshop Theme

Applications in various domains often lead to very high-dimensional data; the dimension of the data being in the hundreds or thousands, for example in text/web mining and bioinformatics. In addition to the high dimensionality, these data sets are also often sparse. Clustering such high-dimensional data sets is a contemporary challenge. Successful algorithms must avoid the curse of dimensionality but at the same time should be computationally efficient.

A one-day workshop on Clustering High Dimensional Data and its Applications will be held in conjunction with SDM 2005 in California (April '05) to bring together researchers to present their current approaches and results in clustering high-dimensional data that arise in various applications. Particular applications of interest are bioinformatics, text mining, market-basket and web log analysis. This is a continuation of a series of workshops on this topic held at previous SDM conferences -- see URL for the workshop at SDM 2004.

Topics of interest include:


Attendees are required to register for SDM 2005, but no separate registration is needed for this workshop.

Submission Requirements

Original papers on clustering high-dimensional data are solicited. For consideration, send an electronic submission (postscript or PDF versions printable on 8.5 x 11 paper only) to Amy Levin:

An email including the title, authors and abstract of the paper should be sent separately in plain ASCII format (no HTML-tags please).

To guarantee consideration, manuscripts must be received by January 10, 2005, and must be no more than 10 pages excluding figures, tables, and references. Submission of work in progress is also encouraged.

All accepted papers whose camera-ready copies are received by the February 25, 2005 deadline (see below) will be distributed as photocopied proceedings available at the conference for purchase by attendees. Electronic copies will also be put on a SIAM web site.

Important Dates

Papers Due:
Jan 10, 2005

Notification of Acceptance:
Feb 4, 2005

Camera ready:
Feb 25, 2005

April 23, 2005

Workshop Schedule

Keynote Speaker: Prof. Kenneth Rose (University of California, Santa Barbara) --- Abstract

Program Committee

Devasis Bassu, Telcordia Research
Cliff Behrens, Telcordia Technologies
Mikhail Belkin, University of Chicago
Paul Bradley, Bradley Data Consulting, LLC
Ian Davidson, SUNY, Albany
Chris Ding, NERSC, Lawrence Berkeley Lab
Jennifer Dy, Northeastern University
Efstratios Gallopoulos, University of Patras
Efim Gendler,
Thomas Hofmann, Brown University
Latifur Khan, University of Texas, Dallas
Andrew Knyazev, University of Colorado
Jon Kettenring, Telcordia Research
Mei Kobayashi, IBM Research
Shailesh Kumar, Fair Isaac
Dharmendra Modha, IBM Almaden Research Center
Nick Street, University of Iowa
Zeev (Vladimir) Volkovich, Ort Braude College, Israel
Shi Zhong, Florida Atlantic University
Leonid Zhukov, Yahoo! Research Labs

Organizing Committee

Inderjit Dhillon
Department of Computer Science
University of Texas
Austin, TX 78712-1188
Phone: (512) 471-9725
Fax: (512) 471-8885

Joydeep Ghosh
Department of Electrical & Computer Engineering
University of Texas
Austin, TX 78712-1188
Phone: (512) 471-8980
Fax: (410) 471-2893

Jacob Kogan
Department of Mathematics and Statistics
Univ. of Maryland, Baltimore County
Baltimore, MD 21250
Phone: (410) 455-3297
Fax: (410) 455-1066

Last modified on Oct 19, 2004.