Course Specifications for CS 371R:
Information Retrieval and Web Search
When & Where: Fall, 2017; TuTh 9:30-11:00 AM; GDC 4.302
Unique Number: 51735
Professor: Ray Mooney, GDC 3.512, 471-9558, firstname.lastname@example.org
Professor's Office Hours: 11am-12pm Mon & Wed or by appointment
TA: Dongguang You (Nick), email@example.com
TA's Office Hours: 4:00pm-5:00pm Tue at TA station desk one, GDC 1st floor; 10:30am-11:30am Fri at TA station desk three, GDC 1st floor
Proctor: TBA, firstname.lastname@example.org
- Prerequisites: Basic knowledge of Java programming, data
structures, and algorithms.
Introduction to Information Retrieval by
Christopher D. Manning
Schütze (see linked website for on-line version,
published by Cambridge University Press in 2008).
- Recommended Supplementary Texts:
Search Engines: Information Retrieval in Practice by
W. B. Croft, D. Metzler, and T. Strohman, Pearson Education, 2009.
Modern Information Retrieval by
Ricardo Baeza-Yates and
Finding Out About by R.K. Belew, Cambridge Univ. Press, 2000.
Readings in Information Retrieval Edited by Karen Sparck Jones and Peter Willett
Mining the Web: Discovering Knowledge from Hypertext Data
Soumen Chakrabarti, Morgan-Kaufmann Publishers, 2003.
Managing Gigabytes : Compressing and Indexing Documents and Images by
Ian H. Witten, Alistair Moffat, and Timothy C. Bell.
- Recommended Java Books:
Java in a Nutshell by David Flanagan
The Java Programming Language
Core Servlets and JavaServer Pages (JSP) by Marty Hall.
- Recommended HTML Books:
by Ian S. Graham.
This course will cover traditional material as well as recent advances in
information retrieval (IR), the study of the processing, indexing, querying,
organization, and classification of textual documents, including hypertext
documents available on the world-wide-web. See the course syllabus for further details.
Course Requirements and Grading
Periodical reading assignments from the text and recent research papers will be
given and should be read before the corresponding lecture.
Students will be held responsible for all information presented either in class
or in the reading assignments.
Class lectures will use Powerpoint presentations available on the
course homepage . This should allow students to focus on understanding the
material during class and reduce the need for taking notes; however, simply
reading the slides is no substitute for attending class in which additional
explanation and discussion is presented.
Prompt attendance at class is required. Attendance will be taken at the very
beginning of every class period and a student's attendance record will count as
part of the course grade.
There will be four programming assignments throughout the semester. These will
involve using and modifying existing Java programs for various IR and web
search tasks. You can use your student account on the department Linux
workstations or any other Java platform available to you (however, we will only
provide support for running on departmental Linux machines). If you are not a
CS student and need a temporary department account, apply on the web
There will be a midterm exam on Thu. Oct. 19 that will cover the material in the
first half of the course. We will review for the test the class before. A
sample former midterm is available on the course home page.
The final will be held Monday, December 18, 2:00-5:00 pm in the normal classroom. It
will be comprehensive test covering all material presented in the course. We
will review for the final on the last class day. A sample former final is
available on the course home page.
We will be using Piazza for class
discussion. If you are not already enrolled for the class from the UT roster,
send me your email to enroll you. Class announcments will be posted here and
please post general class questions to Piazza.
Late Submission and Cheating Policies
Assignments, projects, and exams should be completed independently by
each student and any program code should always be appropriately commented.
Assignments should be
submitted electronically at least 30 minutes before the
start of class on the due date. Be sure to hand in assignments on time, late
penalties are a loss of a percentage of the original overall points for the
assignment: 1 Day: 15%, 2 Days: 40%; 3 Days: 75%; past 3 days: 100%. A day is
a 24 hour period starting 30 minutes before the beginning of class and includes
all weekend days and holidays.
Read the department's
academic policy page. Students who demonstrably violate the Academic
Honesty policy will receive a failing grade in the class. We will be using the
Moss system to
screen submited programs for plagiarism. Over the years, I
unfortunately had to fail about 20 students (6 last year) for copying
on programming assignments. To avoid problems, limit any discussion
of assignments with other students to clarification of the
requirements or definitions of the problems, or to understanding the
existing programs or general course material. Never discuss issues
directly relevant to problem solutions.
Provisions for Students with Disabilities
Students with disabilities may request appropriate academic accommodations from
the Division of Diversity and Community Engagement, Services for Students with
The final grade will be determined as follows: