Summer 2000 Project Ideas June 13,2000 David Colby (utdave@cs.utexas.edu) Daniel Kim (dankim@cs.utexas.edu) Shiraz Laeeq (lucky@cs.utexas.edu) Joe Lenert (joelen@cs.utexas.edu) Jaime Napier (napier@cs.utexas.edu) Brian Quicksall (brianq@cs.utexas.edu) Pierre Tsai (ptsai@cs.utexas.edu) Chris Edmondson-Yurkanan (chris@cs.utexas.edu) ----------------------------------------------------------------------------- 1) To support Goals A and B: what does it mean to "create online, searchable archives" 1.1 what is required from a digital library perspective (answer? some sort of indexing design (title, date, authors, historic designation, unique id, category, keywords...) plus the usage of supporting software, such as XML, XSL, bibtex, DBMS) 1.2 should the documents be scanned and OCRed or retyped? perform an experiment to help determine the answer (what's the best scanner? what's the best OCR software?) 1.3 what is required to create online archives? steps, time, equipment(hw/sw), personnel, $, other questions such as: will sources require that this typically be performed onsite or will they let the materials be sent to a production facility? 1.4 what is required to create *searchable* archives? sw (what features would be most helpful? --- I like the UC Berkeley CS dept search engine, but may not be required for this...) hw ???? (will the load ever be such that we need to purchase a server?) 1.5 hosting: one vs multiple sites? under whose auspices: Univ of Texas, Internet Society, SIGCOMM, Charles Babbage Institute, ACM, IEEE, ISI, Society for the History of Technology, History of Science Society what sysadmin or network administration is needed? 1.6 format of files (searchable files: html with mostly text and occastional links to pdf figures?) (non searchable files: pdf?) 1.7 study related projects (such as history of computing, history of programming languages) CCEY * contact UT HofS, GSLIS, UT Libraries TARO * contact key people in History of computing, archivists, 2) To support Goal C: 2.1 Everyone should read Where Wizards stay up Late (by Hafner and Lyons) 2.2 Build an annotated web page on each of the following network archi- tectures: Baran's, Davies', Arpanet, Early days of the Internet, Cyclades Find all the resources that you can that relate to your topic; papers from old proceedings (study their reference lists); biblio- graphy from the network history books, include the Charles Babbage Institute interviews by name. Resources will be mostly offline, but some may be online; create a bibliography of as many documents as possible, and identify which you think are important enough for us to order, purchase, request copies of....) Build an annotated web page on History of computing what type of work has been done? what people are doing it and where are the archives? What types of technical descriptions are being written? Build an annotated web page on Digital Archiving and Science of History Build an umbrella project web page (where we will archive everything from the summer) 2.3 Divide up the other network history books and write a report on each: strengths of the book; goals and audience of the book. comparison with Where Wizards .... 2.4 Create the taxonomy of network protocols to use in C1/C2/C3 (by surveying terms used by Gouda, Peterson/Davie, Keshav, Tanenbaum, Stallings, with insights from Schulzrinne and Ammar) 2.5 As a group, critique my proposal of the type of writing: C1, C2, C3, and create your own proposal of what needs to be written and how.... 2.6 Write drafts of C1 and C2 for each of Davies, Baran, Arpanet (multiple iterations?), Early Internet days (if time Cyclades). Critique each others writing. Critique each others technical summaries. Now, what? do we still like the definitions of C1 and C2???? or does it make sense to rework this? 2.7 As you work on the above, keep thinking about C4 (any interviews) 3) RE Goal D: I suggest that at least two students work on one or more of the following architectures over the summer Baran's Distributed Communication architecture (??? was it ever built) Donald Davies' NPL architecture (??? was it ever built) Arpanet (iteration 1)... does this make sense? what timeframe? Arpanet (iteration 2) Internet beginnings (through about 1980?) Pouzin's Cyclades 4) To support Goal E: see http://www-ninch.cni.org/PROJECTS/Building/may3.99.html... for a description of a project "BUILDING BLOCKS" funded by nsf with an interesting structure: steering committee, "field committees", workshop groups, multiple mtgs a year CCE-Y: contact Worthy Martin, CS Dept, U of Va. (a steering committee member of Building Blocks) 5) Write a grant proposal.