The HathiTrust repository was created according to the framework for Open Archival Information Systems [1] (OAIS). Definitions from this framework are used in the discussion of specifications for Digital Objects (Archival Information Packages) below.
Definitions
Specifications
Provenance, Reference, and Fixity Information for Content Information in HathiTrust are stored in one or more files conforming to the Metadata Encoding and Transmission Standard [2] (METS). Digital objects or Archival Information Packages from all digitization sources include a "HathiTrust" METS file. AIPs from the Internet Archive and Google include an additional "Source" METS file. These two files are constituted as follows:
Preservation information included in the METS file is recorded using Preservation Metadata Implementation Strategies [3] (PREMIS).
HathiTrust has defined a METS profile for the Google-digitized content archived in the repository, and had defined a generalized policy and specification framework for book and journal content (including image header metadata, resolution, identifiers, etc.). This is available at Getting Content Into HathiTrust [4]). The METS profile for Google-digitized content is given below, along with a summary of its structure. Examples of Google and Internet Archive "Source" and "HathiTrust" METS follow.
The PREMIS implementation used for most volumes in the repository is PREMIS 1.0, though the implementation used for the more recently added Internet Archive-digitized volumes is PREMIS 2.0. A description of HathiTrust's PREMIS 1.0 usage is provided; a description of PREMIS 2.0 is forthcoming. HathiTrust plans to migrate preservation information for all content to PREMIS 2.0 in the near future.
Links:
[1] http://nssdcftp.gsfc.nasa.gov/standards/nost/isoas/int05/CCSDS-650.0-W-2.pdf
[2] http://www.loc.gov/standards/mets/
[3] http://www.loc.gov/standards/premis/
[4] http://www.hathitrust.org/ingest
[5] http://www.hathitrust.org/documents/hathitrust-mets-profile.xml
[6] http://www.hathitrust.org/documents/hathitrust-mets-profile2.0.xml
[7] http://www.hathitrust.org/documents/hathitrust-mets-structure.pdf
[8] http://www.hathitrust.org/documents/example-hathitrust-google-mets.xml
[9] http://www.hathitrust.org/documents/example-source-google-mets.xml
[10] http://www.hathitrust.org/documents/example-hathitrust-ia-mets.xml
[11] http://www.hathitrust.org/documents/example-source-ia-mets.xml
[12] http://www.hathitrust.org/documents/hathitrust-premis-recommendations.pdf