Available Indexes

HathiTrust OAI feed

The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol used in libraries and archives for the automated delivery of structured bibliographic metadata. You can use this option to retrieve metadata in MARC21 or unqualified Dublin Core formats in XML structure. The OAI feed allows you to access new and updated records and to discover if any records have been deleted. For best practices related to OAI, and a list of potential harvesters, see http://webservices.itcs.umich.edu/mediawiki/oaibp/.

The HathiTrust OAI feed is maintained by the University of Michigan and is a set of the broader University of Michigan feed which contains other digital collections. This link  describes the University of Michigan OAI repository.

Notes for working with the HathiTrust OAI feed:

  • The MARC version of the feed does not provide complete MARC records. View the list of fields that are included in the HathiTrust MARC OAI feed at http://bit.ly/2E0Jp8S.
  • Records for a small number of limited-view content may be included in the OAI feed. The OAI feed has been stable and has not been modified in a long time, during which time additional access control codes have been added to the repository. If using the MARC set, filter out any records where the 856 subfield r contains the codes “pd-pvt” or “supp.”
  • Records for limited-view content have been primarily excluded from the OAI feed, despite the previous note. If you need records for limited-view content, please contact HathiTrust User Support.

Retrieve all HathiTrust full-view records

All HathiTrust full-view records can be harvested through the following URLs. The set provides you with access to all of the HathiTrust full-view records, including materials that are only available to users located in the United States and works that are opened with Creative Commons licenses.

  • MARC records: https://quod.lib.umich.edu/cgi/o/oai/oai?verb=ListRecords&metadataPrefix=marc21&set=hathitrust
  • Dublin Core records: https://quod.lib.umich.edu/cgi/o/oai/oai?verb=ListRecords&metadataPrefix=oai_dc&set=hathitrust

This set is best for libraries and organizations that are located within the United States and wish to incorporate records for all content that their users can access.

Retrieve a limited set of records for US access

To retrieve records for materials that are viewable only to users located in the United States use "set=hathitrust:pdus". This set is narrowly scoped to only include records that are viewable in the United States. It does not include materials that are opened to users worldwide or under a Creative Commons license.

  • MARC records: https://quod.lib.umich.edu/cgi/o/oai/oai?verb=ListRecords&metadataPrefix=marc21&set=hathitrust:pdus
  • Dublin Core records: https://quod.lib.umich.edu/cgi/o/oai/oai?verb=ListRecords&metadataPrefix=oai_dc&set=hathitrust:pdus

This set is best for libraries and organizations that are located in the United States and want a smaller collection to include in their systems. For libraries in the US that wish to incorporate records for all content that their users can read online, we recommend the set above for all HathiTrust full-view records; however, it is a significantly bigger set.

Retrieve a limited set of records for non-US access

To retrieve records for materials that are viewable to users located outside the United States, use “set=hathitrust:pd". This set includes materials that are opened to users worldwide, including those opened with a Creative Commons license, but excludes materials that are only viewable to users located in the United States. 

  • MARC records: https://quod.lib.umich.edu/cgi/o/oai/oai?verb=ListRecords&metadataPrefix=marc21&set=hathitrust:pd 
  • Dublin Core records: https://quod.lib.umich.edu/cgi/o/oai/oai?verb=ListRecords&metadataPrefix=oai_dc&set=hathitrust:pd

This set is best for libraries and organizations that are located outside the United States. It excludes records for content that is available only to users located within the United States.

Retrieve individual records

To retrieve a small number of individual records, as opposed to the sets above, we recommend using the Bibliographic API instead. Retrieving records through OAI is best for ongoing syncing with the HathiTrust catalog. The Bibliographic API is best used for live querying against the HathiTrust catalog or to retrieve a small set of bibliographic records.