Attributes supported by the
Harvest
Gatherer
- Abstract
- Brief abstract about the object.
- Author
- Author(s) of the object.
- Description
- Brief description about the object.
- File-Size
- Number of bytes in the object.
- Full-Text
- Entire contents of the object.
- Gatherer-Host
- Host on which the Gatherer ran to extract information from
the object.
- Gatherer-Name
- Name of the Gatherer that extracted information from the
object. (eg. Full-Text, Selected-Text, or Terse).
- Gatherer-Port
- Port number on the Gatherer-Host that serves the Gatherer's
information.
- Gatherer-Version
- Version number of the Gatherer.
- Keywords
- Searchable keywords extracted from the object.
- Last-Modification-Time
- The time that the object was last modified
(in seconds since epoch).
- MD5
- MD5 16-byte checksum of the object.
- Partial-Text
- Only the selected contents from the object.
- Refresh-Rate
- How often the Broker attempts to update the content summary
(in seconds relative to Update-Time).
- Time-to-Live
- How long content summary is valid (in seconds relative to Update-Time).
- Title
- Title of the object.
- Type
- The object's type. Some example types are:
Archive,
Audio,
Awk,
Backup,
Binary,
C,
CHeader,
Command,
Compressed,
CompressedTar,
Configuration,
Data,
Directory,
DotFile,
Dvi,
FAQ,
FYI,
Font,
FormattedText,
GDBM,
GNUCompressed,
GNUCompressedTar,
HTML,
Image,
Internet-Draft,
MacCompressed,
Mail,
Makefile,
ManPage,
Object,
OtherCode,
PCCompressed,
Patch,
Perl,
PostScript,
RCS,
README,
RFC,
SCCS,
ShellArchive,
Tar,
Tcl,
Tex,
Text,
Troff,
Uuencoded, and
WaisSource
- Update-Time
- The time that Gatherer updated (generated) the content summary
from the object (in seconds since the epoch).
- URL
- The original URL of the object.
- URL-References
- Any URL references present within HTML objects.