HathiTrust requires print holdings information from partner institutions to:
- Support analysis of the overlap of institutions' print holdings with digital holdings in HathiTrust, which will form the basis of yearly cost calculations beginning in 2013 (see graphs showing currently partner overlap).
- Provide a foundation for the expansion of legal uses of materials in HathiTrust by partner institutions, including special access to materials under Section 108 of U.S. copyright law, and access to in-copyright works for users with print disabilities.
- Facilitate collaborative collection development and management operations.
These data are required at the time of joining in the form described below.
Fee estimates: For institutions considering membership, we can provide fee estimates using a slightly streamlined version of the data, including only OCLC numbers for print materials, provided in two files: one for monographs and one for serials. Please contact feedback@issues.hathitrust.org with any questions.
Data Elements
Data element | Material types | Required or optional | Use and notes |
1. OCLC # | SPM MPM S | Required | Used for:
Requirements:
|
2. Partner’s Local System ID | SPM MPM S | Required | Used for: Tracking updated holdings submissions over time. Requirements: Either the bibliographic system ID or holdings ID is acceptable, as long as it is used consistently in submissions over time. |
3. Holding status | SPM MPM | If available | Used for:
Values accepted: ‘CH’=current holding Indication of current or withdrawn status will facilitate collective collection management and development, particularly in relation to the print monograph archive (knowing who has what) ‘WD’=withdrawn Holding status specifically does not impact services to users who have print disabilities; as long as the volume is reported in the print holdings, users will have access. ‘LM’=lost or missing Lost or missing status facilitates uses under Section 108 |
4. Condition | SPM MPM | If available | Used for: Expansion of legal uses of materials in HathiTrust by partner institution (specifically Section 108 uses) Value accepted: ‘BRT’=brittle, damaged and/or deteriorating In conjunction with in-print status, will indicate volumes eligible to be accessed by partner institutions under preservation provisions (Section 108 of US Copyright Law or similar laws in other countries) |
5. Item-specific enumeration and chronology | MPM | If available | Used for: Volume-level overlap analysis for multi-part monographs. Providing enumeration chronology information will enable more precise matching for multi-part items, likely reducing calculated fees. It will also increase precision for possible holdings-based services. Requirements: There is no standard format requirement for enumeration and chronology. |
6. ISSN | S | If available | Used for: May be used in the future as a secondary check on overlap analysis for serials |
Submission Notes
- We would like holdings information only for items that are print and bound, and have an OCLC number. Any scores or maps that fall into these categories (e.g., a book of maps) should be included in the holdings files. We do not want holdings records for micoform, eBooks, or other non-print materials.
- The information above should be provided in tab-separated text format. The data can be sent by email or made available for download. Please contact feedback@issues.hathitrust.org when the data is ready to submit.
- Single-part monographic holdings, multi-part monographic holdings, and serial holdings should each be submitted in separate files (one file for each, 3 files total).
- Please include all columns in the order listed, including empty columns for NULL values.
- Examples (<TAB> is included to denote a tab character (\t) for readability):
- single part monographs (include data elents 1, 2, 3, 4):
- "ocn000000001<TAB>bib000001<TAB>CH<TAB>BRT" [all fields supplied]
- "ocn000000001,ocn000000002<TAB>bib000001<TAB>CH<TAB>" [multiple OCLC numbers, null condition field]
- multi-part monographs (include data elements 1, 2, 3, 4, 5):
- "ocn000000001,ocn000000002<TAB>bib000001<TAB>CH<TAB>BRT<TAB>v.1 1923" [multiple OCLC numbers, all fields supplied]
- "ocn000000001<TAB>bib000001<TAB>CH<TAB><TAB>v.1 1923" [null condition field]
- serials (include data elements 1, 2, 6):
- "ocn000000001<TAB>bib000001<TAB>1234-5678" [all fields supplied]
- "ocn000000001<TAB>bib000001<TAB>" [null ISSN]
- single part monographs (include data elents 1, 2, 3, 4):
- Examples (<TAB> is included to denote a tab character (\t) for readability):
- For monographic holdings:
- Please include multiple copies of titles that are held. Multiple copies should be included as separate rows in the file (since they can have separate conditions and statuses).
- For serial holdings:
- Please include one row per title
We will request full updates of the holdings data at least annually, and will be willing to accept updates more frequently. Any updates submitted must be full updates of all holdings files (we are not accepting incremental updates at this time).
Sharing with OCLC
In addition to loading your data into the print holdings database, we would like to request your permission to share this data with OCLC. OCLC is strongly committed to providing support to HathiTrust and is currently engaged in research on how they might store, manage, and provide appropriate access to the kind of item-level print holdings data that is needed for the HathiTrust print holding database. OCLC staff are also engaged in research on the potential for developing systems that could aid libraries in making print holdings disposition decisions. Given its position as a central repository for library holdings data, these seem like natural roles for OCLC and, to the extent possible, we would like to support this work. OCLC staff has requested access to the print holdings data that we are collecting for use in their research. We will share your data with OCLC only if you explicitly grant us permission to do so, but we would appreciate receiving your response to this question in either case (whether you choose to grant permission to share your data or not).