HTRC Collections and Tools


The HathiTrust Research Center will have a number of subject-based collections of public domain materials in HathiTrust. These themed collections will be based on Library of Congress Classification numbers. Our initial collections will include astronomy, Victorian literature, and biology. The themed collections are expected to be accessible for computational analysis through SEASR, web services interfaces, and downloads.


The HTRC will offer a suite of tools for computational text analysis. These tools will cover a wide variety of functions ranging from simple statistical analysis of words to complex algorithms relating concepts and meaning.

The Software Environment for the Advancement of Scholarly Research

The Software Environment for the Advancement of Scholarly Research (SEASR, pronounced “Caesar”) provides a research and development environment capable of powering leading-edge digital humanities initiatives and fosters collaboration by empowering scholars to share data and research in this virtual work environments. SEASR was developed with funding by the Andrew W. Mellon Foundation.

Author Search

The Author search is a demonstration of the integration of the SEASR and the HTRC infrastructure. This tool provides several examples of text analysis and visualization which are created by searching bibliographic information stored in the HathiTrust Research Center collections and calling SEASR web services for analytical processing.