Navigation

Ingest Reports Description

HathiTrust makes ingests reports available for institutions that are contributing content. Below is a description of the information in the reports.

No digital object Items that have bibliographic data in HathiTrust, but have not been queued for ingest, ingested, or failed ingest
Not in GRIN Items that have bibliographic data in HathiTrust, but do not exist in GRIN (subset of "No digital object")
No bib data Items that are available on GRIN and not high error or surrogate (i.e., "ingestible"), but have not had bib data loaded into HathiTrust
Queued/in process Items that have entered the queue to be ingested. The typical ingest rate is up to 30,000 volumes per day; items are ingested in the order enqueued.
Delayed in process Items that are queued for ingest but have not had action in at least 5 days. Normally this indicates items are delayed "IN PROCESS" at Google.
NAFD Items in GRIN (with or without bibliographic data in HathiTrust) with status "NOT_AVAILABLE_FOR_DOWNLOAD"
CHECKED_IN Items in GRIN (with or without bibliographic data in HathiTrust) with status "CHECKED_IN"
High error Items in GRIN (with or without bibliographic data in HathiTrust) with greater than 15% overall error. These may be ingested if quality rises above the error threshold.
Surrogates Items in GRIN (with or without bibliographic data in HathiTrust) with condition 31 and src_library_bibkey set. These are "Surrogate" items, returned when a physical copy of a volume from one library has already been scanned by Google from another.
Failing ingest Items that have failed ingest in the last 7 days (subset of "Failed ingest"). Items in this category will not be automatically retried, but they will be examined and retried if the error was transient.
Failed ingest Items we have attempted to ingest that have failed ingest for any reason (see Ingest Logs) at any time. Items in this category will not be automatically retried, but they will be periodically reviewed & retried if conditions warrant.
Ingested in last week Rough count of items ingested in the preceding week (count from previous week's ingest log)