|
|||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||
See:
Description
| Class Summary | |
|---|---|
| Document | Docment is an abstract class that provides for tokenization of a document with stop-word removal and an iterator-like interface similar to StringTokenizer. |
| DocumentIterator | An object for iterating over a set of documents in a directory. |
| DocumentReference | A simple data structure for storing a reference to a document file that includes information on the length of its document vector. |
| Feedback | Gets and stores information about relevance feedback from the user and computes an updated query based on original query and retrieved documents that are rated relevant and irrelevant. |
| FileDocument | A Document stored as a file. |
| HashMapVector | A data structure for a term vector for a document stored as a HashMap that maps tokens to Weight's that store the weight of that token in the document. |
| HTMLFileDocument | An HTML file document where HTML commands are removed from the token stream. |
| InvertedIndex | An inverted index for vector-space information retrieval. |
| Retrieval | A lightweight object for storing information about a retrieved Document. |
| TextFileDocument | A normal ASCII text file Document |
| TextStringDocument | A simple document represented by a String |
| TokenInfo | A lightweight object for storing information about a token (a.k.a word, term) in an inverted index. |
| TokenOccurrence | A lightweight object for storing information about an occurrence of a token (a.k.a word, term) in a Document. |
Provides basic vector-space information retrieval system.
For command line interfaces see the main methods of the following classes:
|
|||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||