HathiTrust released a new bibliographic management system, Zephir, developed by the California Digital Library. Zephir is custom-made to support the particular needs of bibliographic management in HathiTrust. A full announcement is available at http://www.hathitrust.org/zephir_announcement. See http://www.hathitrust.org/zephir for background on the project and system documentation. From this time, institutions submitting bibliographic metadata to HathiTrust need only submit metadata to Zephir (see http://www.hathitrust.org/bib_data_submission for details).
HathiTrust and DPN
HathiTrust announced its intention to become a “replicating node” in the Digital Preservation Network (DPN). The formal announcement can be read at http://www.hathitrust.org/hathitrust_dpn_announcement.
Call for US Government Documents Records
HathiTrust is issuing a broad call for bibliographic records for US federal government publications from HathiTrust partner and non-partner institutions alike, in support of its initiative to expand and enhance access to US federal government documents. Further information about the initiative and details about the call for records are available at http://www.hathitrust.org/usgovdocs. Records requirements, an information sheet to accompany bibliographic record submissions, and instructions on submission, are available at http://tinyurl.com/kyw26fo.
Board of Governors
The HathiTrust Board of Governors held an in-person meeting in Arlington, VA on October 11. The Board discussed a full agenda, including the ballot initiatives passed at the 2011 constitutional convention, proposals to expand the availability of open access materials in HathiTrust and to broaden access to users who have print disabilities, the HathiTrust Research Center, the Executive Director search, and member issues arising from the bylaws, including an approval process for new members and an annual meeting. Actions on these items will be reported on in this and upcoming monthly updates.
Executive Director Search
The Executive Director search committee worked to narrow a rich pool of candidates and will be conducting phone interviews with selected individuals in November.
Validation service for locally-digitized materials
Completion of a web-based service to validate single image files and a cloud storage-based service to validate entire volumes was delayed in October. These services are now planned for release in November, and December or January, respectively.The creation of these services is the result of conversations held with partners who plan to deposit locally-digitized content in HathiTrust, and a general survey about the usefulness of these services in facilitating local packaging of materials prior to submission to HathiTrust.
HathiTrust answered questions about content ingest from the University of Delaware and Vanderbilt University, reviewed sample content from Texas A&M university, and prepared for ingest of files from the University Press of Florida.
Working Groups and Committees
Program Steering Committee
The Program Steering Committee is developing charges for a Government Documents Initiative Planning and Advisory Group and a Rights and Access Working Group, and expects to recruit members for these groups in the coming weeks. Work continues on charges for groups to advance the establishment of a distributed print archive on monographic holdings corresponding to the digital content in HathiTrust and to continue the work of the Collections Steering Committee setting priorities for expanding collections. The Committee is also reviewing a proposal for a distributed program to certify the quality of volumes within HathiTrust.
Government Documents Registry
The project team reviewed feedback received during focus groups on the government documents registry conducted in late September/early October, and began to develop functional requirements for the registry. The team also began identifying potential strategies for detection of duplicate records. A project timeline has been added to the project web page.
Public Domain Determinations
Public Domain Determinations
HathiTrust Research Center
Members of the HTRC Executive Team attended an in-person meeting of the HathiTrust Board of Governors and will be crafting a business plan for HTRC operations going forward for the Board to review. The HTRC is also making plans to expand the texts included in its research environment to all works in HathiTrust, including those that are in-copyright.
The HTRC held its second monthly HTRC Usergroup meeting, on educational materials related to the HTRC. Notes from the meeting are available on the HTRC wiki. These meetings are open to all who are interested (see this link to sign up; you can also sign up to participate in the HTRC wiki). More information about the HTRC, including directions to sign up for a general HTRC announcements list and announcements related to the HTRC UnCamp are available at http://www.hathitrust.org/htrc.
HathiTrust institutions performed the following work related to applications and Web interfaces:
Staff corrected a problem that resulted in long collection names being truncated.
Version 1 of the Data API was taken out of service on November 1, 2013. Version 2 is the current version.
Staff continued working on web server upgrades for the HathiTrust development environment, and selected developers began to test the servers.
Staff continued work on issues related to relevance ranking and began testing of new algorithms to measure document homogeneity. Staff also corrected an indexing issue that resulted in some Full view works being represented as Limited (search-only) in the online catalog.
HathiTrust users may have experienced slow page loading or errors in page loading on Friday, October 18 from 12:30-9:45pm due to a software release earlier in the day that left page viewing at the Indiana site in a non-working state that was subtle enough to be undetectable to monitoring systems.
As of November 1:
|Library of Congress||0||89,724|
|North Carolina State University||0||3,196|
|New York Public Library||3||288,367|
|Penn State University||526||65,312|
|University of California||16,125||3,435,459|
|The University of Chicago||1,842||35,387|
|University of Florida||0||9,587|
|University of Illinois||275||112,169|
|University of Michigan||5,551||4,662,752|
|University of Minnesota||1,831||112,169|
|University of North Carolina, Chapel Hill||0||17,025|
|University of Wisconsin||7||555,878|
|University of Virginia||4||50,821|
Public Domain (~32%)
* Includes volumes opened through copyright review and rights holder permissions
Summary of Issues Received by User Support
|Access and Use||164||107|
Print on Demand
Full-PDF or e-copy requests
Data Availability and APIs
Reuse of content
Problems with login specifically
General Questions about Login
Partners setting up login
Most Accessed Volumes
- Continue to work on support for indexing of JATS articles.
- Continue development to generate ePub and PDF formats from JATS XML.
- Continue to explore relevance ranking solutions.
Papers & Presentations
- Kevin S. Hawkins, “A Model for Integrating the Publishing and Preservation of Journal Articles (slides) (paper)”, RCDL, October 15, 2013.
- Seth Johnson, Bryan Smith, Kevin S. Hawkins, “mPach: Integrated Publishing and Archiving of Journals in HathiTrust”, Impromtu JATS Users Group Meeting, October 23, 2013