Michigan State University

Datasets for Digital Research

The datasets listed below have been compiled from the Libraries' collections and are offered here as data sources for text mining or other "non-consumptive" research, that is, research conducted by computational methods which does not reproduce significant portions of text for personal or public display.

In addition to the materials below prepared by the MSU Libraries, also be aware of additional corpora available for linguistic research.

Recommendations for acquisition of new datasets, requests for assistance gathering and preparing data, and questions about how to use data may be directed to researchdata@lib.msu.edu.


Hand casting ballot
Comics as Data: North America
(open to all users)
Over 50,000 records.
Years covered: 1888-2018
Size: 29 MB
Hand casting ballot
Fannie Lou Hamer papers, 1966-1978
(open to MSU users)
Number of Works: 640 documents
Years covered: 1966-1978
Size: 14 MB
Sunday School Books in Nineteenth Century America  (open to all users)
Number of Works: 166 works
Years covered: 1809-1887
Size: 11.6 MB
The Michigan Tradesman 
(open to all users)
Number of Works: 1,179 issues
Years covered: 1883-1906
Size: 35 GB
The Grange Visitor 
(open to all users)
Number of Works: 429 issues
Years covered: 1875-1896
Size: 8.53 GB
Michigan Farming Journals 
(open to all users)
Number of Works: 1,954 issues
Years covered: 1878-1938
Size: ~60 GB
Feeding America 
(open to all users)
Number of Works: 76 books
Years covered: late 18th - early 20th century
Size: 78+ MB
MAC/MSC Record
M.A.C/M.S.C Record Dataset 
(open to all users)
Number of Works: 2694 works
Years covered: 1896-1955
Size: 24+ GB
Image Credits: Schoolhouse by Chris Cole, Newspaper by John Caserta, Book by Derrick Snider, Library designed by libberry, Congress by Martha Ormiston, Cooking by Rafael Farias Leao, UX Personas by Matt Wasser, Vote by Re Jean Soo; Newspaper by Trishul; Thor Hammer by Alen Krummenacher; Newspaper by Loïc Poivet; All via the Noun Project