Colloquia: Yuanyuan Zhou/University of Illinois at Urbana-Champaign Department of Computer Science Online Software Error Detection Recovery and Diagnosis during Production Runs in ACES 2.302

Contact Name: 
Jenna Whitney
Date: 
Feb 23, 2006 11:00am - 12:00pm


There is a signup schedule for this event.

Spea

ker Name/Affiliation: Yuanyuan Zhou/University of Illinois at Urbana-Champ

aign Department of Computer Science

Talk Title: Online Software Er

ror Detection Recovery and Diagnosis during Production Runs

Date/Ti

me: Thursday February 23 2006 at 11:00 am

Coffee: 10:45 am
Location: ACES 2.302

Host: Lorenzo Alvisi

Talk Abstract:<

br>As computers have been widely used in almost every aspect of
our dail

y life the issues related to computer system
dependability is becoming

increasingly important.
Unfortunately software errors significantly aff

ect system
reliability and availability. Although many software errors<

br>can be eliminated using various techniques before software
release d

ue to the increasing software complexity many errors
still inevitably e

scape into production runs making system
unavailable and causing severe
damages. Therefore it is
critically important to devise practical and

low-overhead
solutions to automatically detect diagnose and recover fro

m
these errors.

In this talk I will present one important aspect
of our ARTS
project namely online software error detection recovery a

nd
diagnosis during productions runs. Specifically I will
describe a
general system framework that synergistically
combines hardware-assiste

d efficient software error detection
with innovative proactive softwar

e failure recovery via
controlled execution environmental changes. In a

ddition this
framework also provides an effective way for online error<

br>diagnosis during production runs to provide programmers useful
clues

for postmortem analysis. At the end of my talk I will
briefly summarize
our other research work in software
reliability energy management and

database storage as well as
our future research plan.

Speaker Bi

o:
Yuanyuan Zhou is an assistant professor in the
Department of Com

puter Science at Univ of Illinois at Urbana
Champaign since 2002. Prior
to UIUC she worked at NEC
Research Institute as a scientist after comp

leting her Ph.D at
Princeton in 2000. Her research interests span the ar

eas of
operating systems architecture storage systems and software

reliability. She was the recipient for the NSF Career-2004
award the C

RA-W Anita-Borg Early Career Award 2005 the DOE
Early Career Principle

Investigator Award 2005 the IBM Faculty
Award 2004 & 2005 and the IBM

SUR-2003 award. Two of her
papers were among the 14 papers selected int

o the IEEE Micro
Special Issue on Top Picks from architecture conference

s of
2004. Recently one of her SOSP papers was selected for fast
for

warding to ACM Transactions on Computer Systems Special
Issue on Best Pa

pers from SOSP 2005. She was also selected into
the Incomplete List of T

eachers Ranked as Excellent by Their
Students in 2003 at UIUC.