CS 371R:
Information Retrieval and Web Search
Instructor
Raymond J. Mooney
, office hours: Tu. & Th. 10-11AM, CSA 1.102
Teaching Assistant
??
, office hours: ??,
ENS
Lower Basement open lab (across the hall from the elevators).
Time and Place
Spring, 2010; TuTh 11:00-12:30PM; MEZ 1.120 (unique 54285)
General Course Information
Basic course information
Course syllabus
Information on course Java code
Information on submitting projects
Textbook
Introduction to Information Retrieval
by
Christopher D. Manning
,
Prabhakar Raghavan
and
Hinrich Schütze
Programming Projects
Project 0:
Optional
(due Jan. ??)
Project 1: Vector Space Retrieval
(due Feb. ??)
Project 2: Evaluating Performance of Query Operations
(due Feb. ??)
Project 3: Web Spidering and Link Analysis
(due Apr. ??)
Project 4: Text Categorization
(due Apr. ??)
Exams
Midterm (March 11) (
Equation sheet 1
) (
Perl Regex sheet
)
Last year's midterm in PDF
Last year's midterm sample solution in PDF
Final (May 15) (
Equation sheet 2
) (
Equation sheet 3
) (
Link analysis algorithms sheet
)
Last year's final in PDF
Last year's final solution notes
Powerpoint Presentations
Introduction
Boolean and Vector-Space Retrieval Models
Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval
Performance Evaluation of Information Retrieval Systems
Query Operations (Relevance Feedback / Query Expansion)
Query Languages
Text Properties and Languages
Web Search: Introduction
Web Search: Spidering
Web Search: Interfaces
Web Search: Link Analysis
Automated Text Categorization
Automated Text Clustering
Recommender Systems
Information Extraction
(
Sequence Labeling
)
Servlet Demos
Simple Search Engine
Java Course Code
A jar file for the course Java code is available
here
.
JavaDoc for Course Code
All packages
Vector-Space Retrieval
Performance Evaluation
Web Utilities
Text Classifers
Utilities
Java Information
Sun Java Tutorial
For Javadoc for the standard Java libraries see the
Sun Java Platform API Specs
.
Java Servlet/JSP Tutorial
Related Courses
Information Retrieval Course at UMass
Web Search and Mining Course at Stanford
Information Retrieval and Web Agents Course at Johns Hopkins
Intelligent Information Retrieval Course at DePaul
Miscellaneous Links
ACM Special Interest Group on Information Retrieval (SIGIR)
Text REtrieval Conference (TREC)
World-Wide Web Consortium (W3C)
On-line textbook on Information Retrieval by C. J. van Rijsbergen (1979)
Information Retrieval Links
UMass Center for Intelligent Information Retrieval
Bibliography on Zipf's Law
Web Robots Pages
Prosecuting Bots for Trespassing (e.g. Ebay vs. Bidder's Edge)
(or try a Google search on "robots.txt lawsuit")
Search Engine Watch
Search Tools for Web Sites
History of Search Engines
Scientific American articles on
XML
and the
Semantic Web
Web IR and IE
Reading List on Machine Learning and Information Retrieval
Repository of Online Information Sources Used in Information Extraction Tasks
Bibliography on Automated Text Categorization
Recommender Systems Links
NY Times article on Text Mining
Relevant Books Written for the General Public
Weaving the Web: The original design and ultimate destiny of the World Wide Web, by its inventor
, Tim Berners-Lee with Mark Fischetti, 1999.
Speeding the Net: The Inside Story of Netscape and How It Challenged Microsoft
, Joshua Quittner, Michelle Slatalla, 1998.
The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture
, John Battelle, 2005.
The Google Story
,David Vise and Mark Malseed , 2005.
Linked: The New Science of Networks: How Everything is Connected to Everything Else and What it Means for Science, Business and Everyday Life
, A.L. Barabasi, 2002. (book on the statistical properties of the Web and other graph structures in nature)
mooney@cs.utexas.edu