CONSTRUCTION UPDATE! Please be aware that construction will be ongoing throughout the summer in the café area and on 2-West (impacts will include the Nursing Mother's Room). For more information on these projects, please visit our Summer Construction update page.
Feeding America: The Historic American Cookbook Dataset
The Feeding America: The Historic American Cookbook Dataset
The Feeding America: The Historic American Cookbook dataset contains transcribed and encoded text from 76 influential American cookbooks held by MSU Libraries Stephen O. Murray and Keelung Hong Special Collections. Features encoded within the text include but are not limited to recipes, types of recipes, cooking implements, and ingredients. The 76 texts were chosen among more than 7000 cookbooks that MSU Libraries holds as representative of periods and themes in American cookbook history spanning the late 18th to early 20th century.
Feeding America: The Historic American Cookbook Dataset. East Lansing: Michigan State University Libraries Stephen O. Murray and Keelung Hong Special Collections. https://lib.msu.edu/feedingamericadata/
The Feeding America: The Historic American Cookbook project, from which this dataset was derived, was made possible with funds from a 2001 IMLS National Leadership Grant. The project began September 1, 2001 and was completed August 31, 2003.
The "Feeding America: The Historic American Cookbook" dataset contains 76 plain text files of transcribed cookbook text, 76 XML files of encoded cookbook text, 1 XML file that includes metadata records for each cookbook in the dataset, and 1 DTD file that describes the schema that was used to encode the cookbooks.
File Naming Conventions
- content_type - e.x. cookbook_text.zip
- bookname - amem.xml
- metadata - 293KB
- plain text - 16 MB compressed, 64 MB uncompressed
- encoded text - 17.6 MB compressed, 78.9 MB uncompressed
- dtd - 21 KB
Quality of text is of high fidelity to original cookbook text given transcription of text rather than application of optical character recognition (OCR). Text data is enhanced by encoding of features of text like recipe, recipe type, ingredient, measurements, and cooking implements.
Data description prepared by Thomas Padilla, Devin Higgins, and Lucas Mak.
Credit is also due to Ruth Ann Jones for writing the DTD that defines the schema used to encode the cookbooks and Amy Vance for leading the charge on the encoding process. Data is derived from Feeding America: The Historic American Cookbook Collection