CS 395T: Large-Scale Data Mining

Instructor:

Inderjit Dhillon

Homepage:

cs.utexas.edu/users/inderjit/courses/datamining.html

Description

This graduate course will focus on scalable algorithms for data mining. A special emphasis will be on information retrieval, especially for the World Wide Web. Topics covered will include (i) link analysis on the internet (like Google), (ii) content analysis of documents using SVD (Singular Value Decomposition), SVMs (Support Vector Machines), and linear discriminant analysis, (iii) image segmentation using graph partitioning, (iv) face detection in images, (v) clustering and classification algorithms, (vi) visualization of high-dimensional data, etc. We may study other application areas, such as bioinformatics, if there is sufficient interest and need.

Prerequisites

Graduate standing required.