This course will cover traditional material as well as recent advances in information retrieval (IR), the study of the indexing, processing, and querying of textual data. The focus will be on newer techniques that try to move beyond keyword search and bring some "intelligence" to the task of processing and retrieving textual information, including hypertext documents available on the world-wide-web. See the course syllabus for further details.
Periodical reading assignments from the text and recent research papers will be given and should be read before the corresponding lecture. Students will be held responsible for all information presented either in class or in the reading assignments.
Class lectures will use Powerpoint presentations available on the course homepage . This should allow students to focus on understanding the material during class and reduce the need for taking notes; however, simply reading the slides is no substitute for attending class in which additional explanation and discussion is presented.
Prompt attendance at class is mandatory. Attendance will be taken at the very beginning of every class period and a student's attendance record will count as part of the course grade. Leaving during the middle of class in unacceptable and will count as missing two class periods.
There will be four programming assignments throughout the semester. These will involve using and modifying existing Java programs for various IR and web search tasks. If you do not already know Java you will be expected to learn it on your own. You can use your student account on the department Unix workstations or any other Java platform available to you (however, we will only provide support for running on departmental Unix machines). If you are not a CS student and need a temporary department account, apply on the web at https://udb.cs.utexas.edu/amut/acut/.
There will be a midterm exam the last class before spring break that will cover the material in the first half of the course. We will review for the test the class before.
The final will be held at 7-10PM on Sat, May 15 in the normal classroom. It will be comprehensive test covering all material presented in the course. We will review for the final on the last class day.
Read the department's academic policy page at http://www.cs.utexas.edu/users/ear/CodeOfConduct.html. Students who demonstrably violate the Academic Honesty policy will receive a failing grade in the class. We will be using the Moss system to screen submited programs for plagiarism. Over the past several years, I unfortunately had to fail about 10 students for copying on programming assignments. To avoid problems, limit any discussion of assignments with other students to clarification of the requirements or definitions of the problems, or to understanding the existing programs or general course material. Never discuss issues directly relevant to problem solutions.
46% Assignments 18% Midterm 28% Final 8% Attendance