The "Hathifiles" are tab-delimited text files that describe every item in the HathiTrust collection. They include information derived from the bibliographic record (e.g., title, publisher, language, commonly used identifiers, etc.), rights and access codes, and information about the source of the item.
A description of the fields included in the hathifiles as well as potential use cases is provided in the “Hathifiles Description” page.
Files provided below
A monthly file is uploaded on the first of every month with a row for every item that is in the HathiTrust collection at the moment the file is created. The filename begins with “hathi_full_”. These files tend to be large and may be difficult to open with standard spreadsheet software or text editors. You may need to work with the files programmatically (e.g., using Python to extract desired data).
An update file is uploaded every day and contains a row for every item that has changed in the previous 24 hours. The filename begins with “hathi_upd_”. Items are included in the update files if any of the following has occurred: the item was newly deposited into the collection, a new copy of the digital item overrode the previous copy, the rights and access status has changed, or a new bibliographic record was provided by the contributor.
A “header” file is also included below. This file contains one row of labels for the data elements included in the hathifiles. It can be combined with the regular hathifiles for ease of working with the data. This header file is only updated when a new data element is added to the hathifiles.
Display name | created | size | modified | Mime type![]() | |
---|---|---|---|---|---|
![]() | hathi_upd_20230129.txt.gz | January 29, 2023 | 2.07 MB | January 29, 2023 | application/octet-stream |
![]() | hathi_upd_20230201.txt.gz | February 1, 2023 | 2.21 MB | February 1, 2023 | application/octet-stream |
![]() | hathi_upd_20230204.txt.gz | February 4, 2023 | 1.92 MB | February 4, 2023 | application/octet-stream |
![]() | hathi_upd_20230311.txt.gz | March 11, 2023 | 2.52 MB | March 11, 2023 | application/octet-stream |
![]() | hathi_upd_20230113.txt.gz | January 13, 2023 | 539.61 KB | January 13, 2023 | application/octet-stream |
![]() | hathi_upd_20230301.txt.gz | March 1, 2023 | 2.68 MB | March 1, 2023 | application/octet-stream |
![]() | hathi_upd_20230318.txt.gz | March 18, 2023 | 1.97 MB | March 18, 2023 | application/octet-stream |
![]() | hathi_upd_20230312.txt.gz | March 12, 2023 | 2.58 MB | March 12, 2023 | application/octet-stream |
![]() | hathi_upd_20230225.txt.gz | February 25, 2023 | 1.63 MB | February 25, 2023 | application/octet-stream |
![]() | hathi_upd_20230322.txt.gz | March 22, 2023 | 2.95 MB | March 22, 2023 | application/octet-stream |
![]() | hathi_upd_20230308.txt.gz | March 8, 2023 | 2.33 MB | March 8, 2023 | application/octet-stream |
![]() | hathi_upd_20230120.txt.gz | January 20, 2023 | 1.78 MB | January 20, 2023 | application/octet-stream |
![]() | hathi_upd_20230207.txt.gz | February 7, 2023 | 104.03 KB | February 7, 2023 | application/octet-stream |
![]() | hathi_upd_20230123.txt.gz | January 23, 2023 | 77.48 KB | January 23, 2023 | application/octet-stream |
![]() | hathi_upd_20230307.txt.gz | March 7, 2023 | 2.69 MB | March 7, 2023 | application/octet-stream |
![]() | hathi_upd_20230304.txt.gz | March 4, 2023 | 1.83 MB | March 4, 2023 | application/octet-stream |
![]() | hathi_upd_20230314.txt.gz | March 14, 2023 | 2.83 MB | March 14, 2023 | application/octet-stream |
![]() | hathi_full_20230201.txt.gz | February 2, 2023 | 1.05 GB | February 1, 2023 | application/octet-stream |
![]() | hathi_upd_20230215.txt.gz | February 15, 2023 | 3.17 MB | February 15, 2023 | application/octet-stream |
![]() | hathi_full_20230301.txt.gz | March 2, 2023 | 1.05 GB | March 1, 2023 | application/octet-stream |
![]() | hathi_upd_20230227.txt.gz | February 27, 2023 | 991.83 KB | February 27, 2023 | application/octet-stream |
![]() | hathi_upd_20230309.txt.gz | March 9, 2023 | 2.2 MB | March 9, 2023 | application/octet-stream |
![]() | hathi_upd_20230224.txt.gz | February 24, 2023 | 395.63 KB | February 24, 2023 | application/octet-stream |
![]() | hathi_upd_20230223.txt.gz | February 23, 2023 | 1008.28 KB | February 23, 2023 | application/octet-stream |
![]() | hathi_upd_20230310.txt.gz | March 10, 2023 | 1.59 MB | March 10, 2023 | application/octet-stream |
![]() | hathi_upd_20230305.txt.gz | March 5, 2023 | 2.88 MB | March 5, 2023 | application/octet-stream |
![]() | hathi_upd_20230125.txt.gz | January 25, 2023 | 2.9 MB | January 25, 2023 | application/octet-stream |
![]() | hathi_upd_20230212.txt.gz | February 12, 2023 | 100.37 KB | February 12, 2023 | application/octet-stream |
![]() | hathi_upd_20230226.txt.gz | February 26, 2023 | 2.91 MB | February 26, 2023 | application/octet-stream |
![]() | hathi_upd_20230303.txt.gz | March 3, 2023 | 1.08 MB | March 3, 2023 | application/octet-stream |
![]() | hathi_upd_20230203.txt.gz | February 3, 2023 | 2.39 MB | February 3, 2023 | application/octet-stream |
![]() | hathi_upd_20230214.txt.gz | February 14, 2023 | 7.04 KB | February 14, 2023 | application/octet-stream |
![]() | hathi_upd_20230127.txt.gz | January 27, 2023 | 3.14 MB | January 27, 2023 | application/octet-stream |
![]() | hathi_field_list.txt | November 15, 2022 | 307 bytes | April 1, 2022 | text/plain |
![]() | hathi_upd_20230315.txt.gz | March 15, 2023 | 3.59 MB | March 15, 2023 | application/octet-stream |
![]() | hathi_upd_20230112.txt.gz | January 12, 2023 | 62.65 KB | January 12, 2023 | application/octet-stream |
![]() | hathi_upd_20230306.txt.gz | March 6, 2023 | 2.28 MB | March 6, 2023 | application/octet-stream |
![]() | hathi_upd_20230126.txt.gz | January 26, 2023 | 2.15 MB | January 26, 2023 | application/octet-stream |
![]() | hathi_upd_20230210.txt.gz | February 10, 2023 | 1.24 MB | February 10, 2023 | application/octet-stream |
![]() | hathi_upd_20230211.txt.gz | February 11, 2023 | 79.44 KB | February 11, 2023 | application/octet-stream |
![]() | hathi_upd_20230220.txt.gz | February 20, 2023 | 1.28 MB | February 20, 2023 | application/octet-stream |
![]() | hathi_upd_20230319.txt.gz | March 19, 2023 | 5.94 MB | March 19, 2023 | application/octet-stream |
![]() | hathi_upd_20230317.txt.gz | March 17, 2023 | 2.58 MB | March 17, 2023 | application/octet-stream |
![]() | hathi_upd_20230302.txt.gz | March 2, 2023 | 2.51 MB | March 2, 2023 | application/octet-stream |
![]() | hathi_upd_20230117.txt.gz | January 17, 2023 | 1.74 MB | January 17, 2023 | application/octet-stream |
![]() | hathi_upd_20230206.txt.gz | February 6, 2023 | 6.1 KB | February 6, 2023 | application/octet-stream |
![]() | hathi_upd_20230313.txt.gz | March 13, 2023 | 2.49 MB | March 13, 2023 | application/octet-stream |
![]() | hathi_upd_20230216.txt.gz | February 16, 2023 | 806.68 KB | February 16, 2023 | application/octet-stream |
![]() | hathi_upd_20230208.txt.gz | February 8, 2023 | 927.22 KB | February 8, 2023 | application/octet-stream |
![]() | hathi_upd_20230219.txt.gz | February 19, 2023 | 670.52 KB | February 19, 2023 | application/octet-stream |
![]() | hathi_upd_20230320.txt.gz | March 20, 2023 | 3.08 MB | March 20, 2023 | application/octet-stream |
![]() | hathi_file_list.json | March 22, 2023 | 17.73 KB | March 22, 2023 | application/octet-stream |
![]() | hathi_upd_20230217.txt.gz | February 17, 2023 | 537.44 KB | February 17, 2023 | application/octet-stream |
![]() | hathi_upd_20230111.txt.gz | January 11, 2023 | 50.12 KB | January 11, 2023 | application/octet-stream |
![]() | hathi_upd_20230221.txt.gz | February 21, 2023 | 479.97 KB | February 21, 2023 | application/octet-stream |
![]() | hathi_upd_20230115.txt.gz | January 15, 2023 | 2.97 MB | January 15, 2023 | application/octet-stream |
![]() | hathi_upd_20230202.txt.gz | February 2, 2023 | 1.44 MB | February 2, 2023 | application/octet-stream |
![]() | hathi_upd_20230228.txt.gz | February 28, 2023 | 2.2 MB | February 28, 2023 | application/octet-stream |
![]() | hathi_upd_20230205.txt.gz | February 5, 2023 | 1.27 MB | February 5, 2023 | application/octet-stream |
![]() | hathi_upd_20230130.txt.gz | January 30, 2023 | 3.29 MB | January 30, 2023 | application/octet-stream |
![]() | ucal_barcodes_dollarified_201403.txt.gz | March 13, 2014 | 686.82 KB | March 13, 2014 | application/octet-stream |
![]() | hathi_upd_20230209.txt.gz | February 9, 2023 | 1.59 MB | February 9, 2023 | application/octet-stream |
![]() | hathi_upd_20230118.txt.gz | January 18, 2023 | 976.85 KB | January 18, 2023 | application/octet-stream |
![]() | hathi_upd_20230128.txt.gz | January 28, 2023 | 2.88 MB | January 28, 2023 | application/octet-stream |
![]() | hathi_upd_20230114.txt.gz | January 14, 2023 | 2.31 MB | January 14, 2023 | application/octet-stream |
![]() | hathi_upd_20230213.txt.gz | February 13, 2023 | 6.99 KB | February 13, 2023 | application/octet-stream |
![]() | hathi_upd_20230122.txt.gz | January 22, 2023 | 2.84 MB | January 22, 2023 | application/octet-stream |
![]() | hathi_upd_20230131.txt.gz | January 31, 2023 | 1.77 MB | January 31, 2023 | application/octet-stream |
![]() | hathi_upd_20230121.txt.gz | January 21, 2023 | 2.12 MB | January 21, 2023 | application/octet-stream |
![]() | hathi_upd_20230119.txt.gz | January 19, 2023 | 1.01 MB | January 19, 2023 | application/octet-stream |
![]() | hathi_upd_20230116.txt.gz | January 16, 2023 | 2.01 MB | January 16, 2023 | application/octet-stream |
![]() | hathi_upd_20230321.txt.gz | March 21, 2023 | 3.39 MB | March 21, 2023 | application/octet-stream |
![]() | hathi_upd_20230124.txt.gz | January 24, 2023 | 2.81 MB | January 24, 2023 | application/octet-stream |
![]() | hathi_upd_20230218.txt.gz | February 18, 2023 | 576.94 KB | February 18, 2023 | application/octet-stream |
![]() | hathi_upd_20230316.txt.gz | March 16, 2023 | 2.16 MB | March 16, 2023 | application/octet-stream |
![]() | hathi_upd_20230222.txt.gz | February 22, 2023 | 442.65 KB | February 22, 2023 | application/octet-stream |