External id: 535
Manager Email: waarala2@illinois.edu
Owner Email: dsears@illinois.edu
Start Date: 2016-02-11
Status: active


Digitization of the Cavagna Collection

Cavagna was an Italian public official and a recognized authority on the local history of Lombardy and Piedmont. His library contained a great many books on genealogy, biography, and local history, including materials on municipal governments. The manuscripts especially reflect the study of local history; most relate to Italian cities and towns, institutions, societies and families.

All aspects of Italian history, from the Middle Ages to the first years of the twentieth century, are prominently represented in the Cavagna collection, as is literature on Italian art and architecture. Other topics which are heavily represented are law, economies, biography, archaeology, chivalry, and records of Italian universities and academies.

Among the books in the collection, mostly written in Italian are some incunabula, rare and early printed books, and first editions. Many of the historical documents are unique. In addition to books, pamphlets, and manuscripts, the Cavagna library includes several thousand maps, both ancient and modern.

Digital Content Creation is including the Cavagna Collection in it's list of ongoing digital project in parternership with the Rare Book & Mansucript Library.  As of 2016 - over 900 Cavagna titles have been digitized, processed, ingested into Medusa for presrvation and ingested in HathiTrust for access.


Moving foward the library plans to pursue digitiztion of the colleciton on a larger scale through additional funding.



Cataloging status

  • RBML reviews the catalog record and condition of the content upon selecting and prioritization in digital queue.
  • The barcodes are scanned into Medusa Project Tracking Database.  Bibilipgrahic fields including title, author, publisher, date, physical description and uniquie bibliographic identifer are autopopulated by quering Voyager.
  • This creates a tracking record for the digitizaiton lab to use during digiztion.  Digitizion date, equipment and photographer are recorded.
  • As status field is selected to notigy the content provider that items is done, in process, send to conservation or routed to another digitaiotn workflow.

File Naming

Our book digitazgion file naming standards ahere to HathiTrust Digital Library convention.

  • Name files with 8 digit padded number (ex. 00000001, 00000002, 00000003)
  • If digitization of multiple volumes happens during digitization sessions, added bibid to file name (must be removed later before ingest into Medusa) (ex. Bibid_00000001)

Digitization Equipment 

  • BC100

Deliverables (file formats)

  • Preservation masters tif for Medusa
  • Access tif for HathiTrust

Preservation Master

  • 600 DPI
  • Fixed pixel array
  • Adobe RGB
  • 16 bit
  • Include target item level target (beginning/end of volume)
  • Crop outside page edge


  • 400 DPI
  • 3000 pixels on the long edge
  • sRGB
  • 8 bit
  • Crop outside page edge
  • Include appropriate production note/blank 

Directory Structure:

  • Save preservation master files/access files within folder named with bibid.

Destination of files:

Working files (prior to ingest into HathiTrust)

  • \\storage.library.illinois.edu\dcc\DCC Unprocessed Files\Digital_Rare_Book_Collections\QAqueue

Access files

  • \\storage.library.illinois.edu\hathitrust\packages\DCC

Preservation Master files per batch (ex. date_projectname_repositoryname)

  • \\storage.library.illinois.edu\dcc\MedusaStaging

JIRA is used to manage internal digitization workflow. Steps are assigned to differnt members of the digitation team.  Task are marked done as they are finished.  Comments or issues are recorded if neccessary.  Tasks in the Cavagna Project include:

  • Verify title and bibliographic information in tracking database
  • Digitize content via the BC100. Transfer to network for processing of derivatives from raw capture files
  • Edit and process derivatives in batch
  • Deliver files to the server for quality assurance.
  • Quality assurance via batch checksum, verify color and exposure accuracy
  • Prep packages for ingest into HathiTrust Digital Library and Medusa Preservation Repository
  • Deliver Preservation Masters for ingest : notification is sent to preservation staff to prompt ingest of the batch.
  • Metadata creation - notification sent to Content Access Management requesting required metadata for the batch.  Batche metatdata is sent to  HathiTrusts Digital Library.MARCXML files are added to folders for each individual title in the batch.
  • Deliver complete package to HathiTrust for ingest via FTP. Noitify HathiTrust via email at feedback@issues.hathitrust.org. Enclose submission and any signed permission to digitize forms.  
  • Note verivication email of ingest and estimate date of access to the digital object from HathiTrust support.
  • Quality assurance to verify succesful ingest and assign digitized book to Rare Book & Manuscript Library Collection in HahtiTrust.
  • Add access handle to tracking database for RBML