To UTCS home To LASR

 
  « Distinguished Lecture Series on Internet and Grid Computing
 
Ian Foster   Grid Computing and the Open Grid Services Architecture

Thursday, November 7, 2002  ¤  10:30am Coffee   11:00am-12:00pm  ¤  ACES 2.402 (seminar)

The Virtual Data Grid: A New Model and Architecture for Data-Intensive Collaboration

Friday, November 8, 2002  ¤  10:30am Coffee   11:00am-12:00pm  ¤  Taylor 3.128
Ian Foster     foster@mcs.anl.gov
Associate Director, Mathematics & Computer Science Division, and Head,
Distributed Systems Lab, Argonne National Laboratory
Professor of Computer Science, University of Chicago
 
Abstract    Grid Computing and the Open Grid Services Architecture
In both e-business and e-science, we often need to integrate services across distributed, heterogeneous, dynamic "virtual organizations" formed from the disparate resources within a single enterprise and/or via external resource sharing relationships. This integration can be technically challenging due to the need to achieve various qualities of service in heterogeneous environments. I introduce this "Grid problem," discuss the origins and applications of Grid technologies in the world of science, and present recent work on an Open Grid Services Architecture that seeks to generalize Grid computing concepts to create a powerful framework for distributed resource sharing and management.
 
Abstract    The Virtual Data Grid: A New Model and Architecture for Data-Intensive Collaboration
It is increasingly common to encounter communities engaged in the collaborative analysis and transformation of large quantities of data over extended periods of time. We argue that these communities require a scalable system for managing, tracing, exploring and communicating the derivation and analysis of diverse data objects. Such a system could bring significant productivity increases facilitating discovery, understanding, assessment, and sharing of both data and transformation resources, as well as facilitating the productive use of distributed resources for computation, storage, and collaboration. Thus, we define a model and architecture for a virtual data grid capable of addressing this requirement. We define a broadly applicable model of a "typed dataset" as the unit of derivation tracking, and simple constructs for describing how datasets are derived from transformations and from other datasets. We also define mechanisms for integrating with, and adapting to, existing data management systems and transformation and analysis tools, as well as Grid mechanisms for distributed resource management and computation planning. We report on successful application results obtained with a prototype implementation called Chimera, involving challenging analyses of high-energy physics and astronomy data.
 
Biography   Ian Foster received his PhD in Computer Science from Imperial College, England. Dr. Foster's research interests include algorithms and programming languages for scalable parallel computers, software engineering, and the application of parallel processing to problems in computational science. In 1989 Dr. Foster was awarded the British Computer Society Award for Technical Innovation for his work on the Strand parallel programming language. Dr. Foster has written 3 books, including "The Grid: Blueprint for a New Computing Infrastructure," and "Designing and Building Parallel Programs," and he has authored numerous technical papers. Dr. Foster has served as program chair for over 10 conferences, including the ACM Symposium on Principles and Practices of Parallel Programming and Supercomputing, and he currently is on the editorial board for IEEE Parallel and Distributed Technology.
« 10/14 Vahdat 11/14-15 Kaashoek »