Craig Corcoran
Jan 24, 2013 11:00am - 12:00pm

Kinect-style depth cameras offer real-time synchronized color and depth data in a convenient package at a consumer price. Such RGB-D cameras are dramatically changing the research and application landscapes of vision, robotics and HCI. I will take you through our journey of investigating and promoting the joint uses of color and depth toward rich sensing solutions under real-world conditions, from 3D modeling of indoor environments to fine-grained recognition of objects, scenes and activities. Our main approach is feature learning, designing and learning rich features in hierarchical structures that seamlessly apply to both color and depth. Our work on hierarchical matching pursuit uses efficient sparse coding algorithms, namely Orthogonal Matching Pursuit and K-SVD, as building blocks to extract rich features at varying scales and deformations, outperforming hand-designed features by large margins on both color and RGB-D object recognition. Such learned features also help to improve the states of the art on a variety of tasks such as scene classification, labeling and segmentation. RGB-D perception shines in both robustness and efficiency, on the fast track of becoming the general sensing solution for future pervasive and context-aware systems.