public final class SafeHTMLPage extends HTMLPage
SafeHTMLPageRetriever to allow
clients to facilitate writing spiders that obey both the Robots
Exclusion Protocol and the Robots META tags.| Constructor and Description |
|---|
SafeHTMLPage(Link link,
java.lang.String text,
boolean index)
Constructs an
SafeHTMLPage with the given link,
text, and indication whether or not indexing is allowed. |
| Modifier and Type | Method and Description |
|---|---|
boolean |
indexAllowed()
Indicates whether or not indexing has been disallowed by a
Robots META tag.
|
addEndSlash, empty, getLink, getOutLinks, getText, setOutLinks, writepublic SafeHTMLPage(Link link, java.lang.String text, boolean index)
SafeHTMLPage with the given link,
text, and indication whether or not indexing is allowed.link - A Link object representing the given page.text - The text of the page.index - Should be true iff. the page can be
indexed.public boolean indexAllowed()
SafeHTMLPageRetriever that
supports Robots META tags, such as SafeHTMLPageRetriever.indexAllowed in class HTMLPagetrue iff. the page can be indexed.