Major library partners launch HathiTrust, a shared digital repository
A group of the nation's largest research libraries are collaborating to create a repository of their vast digital collections, including millions of books. The holdings will be archived and preserved in a single repository called the HathiTrust. Materials in the public domain will be available for reading online.
Launched jointly by the 12-university consortium known as the Committee on Institutional Cooperation (CIC) and the 11 university libraries of the University of California system, the HathiTrust leverages the time-honored commitment to preservation and access to information that university libraries have valued for centuries. UC's participation will be coordinated by the California Digital Librar. (CDL), which brings its innovative experience in digital curation and online scholarship to the HathiTrust.
"This effort combines the expertise and resources of some of the nation's foremost research libraries and holds even greater promise as it seeks to grow beyond the initial partners," said John Wilkin, associate university librarian of the University of Michigan and the newly named executive director of HathiTrust. Hathi (pronounced HAH-tee), the Hindi word for elephant, is incorporated into the repository's name and underscores the immensity of this undertaking, Wilkin said. Elephants also evoke memory, wisdom and strength.
As of today, HathiTrust contains more than 2 million volumes and approximately a billion pages, about 16 percent of which are in the public domain. Public domain materials will be available for reading online. Materials protected by copyright, although not available for reading online, are given the full range of digital archiving services, thereby offering member libraries a reliable means to preserve their collections. Organizers also expect to use those materials in the research and development of the Trust.
Volumes are added to the repository daily, and content will grow rapidly as the University of California, CIC member libraries and other prospective partners contribute their digitized content. The founding partners have been joined by the University of Virginia since announcement of the repository was made in October.
Each of the founding partners brings extensive and highly regarded expertise in the areas of information technology, digital libraries and project management to this endeavor. Creation of the HathiTrust supports the digitization efforts of the CIC and the University of California, each of which has entered into collective agreements with Google to digitize portions of the collections of their libraries, more than 10 million volumes in total, as part of the Google Book Search project. Materials digitized through other means will also be made available through HathiTrust.
HathiTrust provides libraries a means to archive and provide access to their digital content, whether scanned volumes, special collections or born-digital materials. Preserving materials for the long term has long been a mission and driving force of leading research libraries. Their collections, accumulated over centuries, represent a treasury of cultural heritage and investment in the broad public good of promoting scholarship and advancing knowledge. The representation of these resources in digital form provides expanded opportunities for innovative use in research, teaching and learning, but must be done with careful attention to effective solutions for the curation and long-term preservation of digital assets.
"The CIC Libraries have always worked at a large scale, with big collections, big user communities and high expectations for service. They are not intimidated by big challenges, and will bring their comfort with this to the development of the shared digital repository," said Mark Sandler, director of the CIC Center for Library Initiatives.
"The University of California libraries have an unparalleled reputation for innovation in digital library development and inter-institutional collaboration," said Laine Farley, interim executive director of the California Digital Library. "Participation in the HathiTrust continues this tradition and will enable UC to provide its students and scholars with access to one of the most significant digital collections ever assembled."
"Researchers will benefit from the expert curation and consistent access they have long associated with the CIC research libraries," said Michael McRobbie, president of Indiana University. "Great libraries have long been essential to outstanding scholarship, and the HathiTrust collaboration among the CIC institutions, the University of California and others provides an essential tool for 21st-century scholars."
"Digitization of print texts has the promise of being transformative of scholarship and of library practice," said Paul Courant, University of Michigan librarian, dean of libraries and former provost. "In both areas, the ability to search many texts and to preserve texts accessibly creates tremendous opportunities for collaboration amongst scholars and universities. HathiTrust has made a good start, and like the elephant for which it is named, we expect that it will prove able to carry and deliver valuable resources with grace and reliability."
"Before this collaboration," Wilkin said, "the collections in each library existed in isolation. Now we are bringing them together, pooling resources and eliminating redundancies and producing a valuable research tool that will be greater than the sum of its parts."
The CIC and the University of California each produce an estimated 10 percent of the new Ph.D.'s granted in the U.S. each year and together serve more than 600,000 students.
The Midwest-based CIC is comprised of the universities of the Big Ten, plus the University of Chicago. Partner libraries represent IU, University of Illinois, University of Illinois at Chicago, University of Iowa, University of Michigan, Michigan State University, University of Minnesota, Northwestern University, Ohio State University, Penn State University, Purdue University and University of Wisconsin-Madison.
The University of California system includes ten research universities at Berkeley, Davis, Irvine, Los Angeles, Merced, Riverside, San Diego, San Francisco, Santa Barbara and Santa Cruz plus the systemwide California Digital Library.