Notice: The Records Management program is currently in the process of transitioning from the University Archives and Historical Collections (UAHC) to the Office of Audit, Risk and Compliance (OARC). If you have retention based records that will eventually be destroyed and need to be stored off-site, please send an email to OARC.temporaryrecords@msu.edu.
Notice: The Records Management program is currently in the process of transitioning from the University Archives and Historical Collections (UAHC) to the Office of Audit, Risk and Compliance (OARC). If you have retention based records that will eventually be destroyed and need to be stored off-site, please send an email to OARC.temporaryrecords@msu.edu.
Notice: Beginning Tuesday, January 21st, all MSU all students, staff, and faculty must scan their MSU ID to access the MSU Library building between 10:00 pm – 7:30 am on Sunday - Thursday.
Notice: Beginning Tuesday, January 21st, all MSU all students, staff, and faculty must scan their MSU ID to access the MSU Library building between 10:00 pm – 7:30 am on Sunday - Thursday.

Website Archive Scan Request

This form is for MSU Faculty and Staff to submit requests to scan a webpage or a site for archival purposes. The archive will be publicly available after scanning. There will be default duration and data limits set for every crawl.

Requests should be made at least one week in advance.

Contact Information
Name
Scan Details
What date do you need the crawling to start on? Depending on the quantity of content, it may take more than a day to scan.
It is possible to set up a recurring scan. By default, we’ll assume only a one-time scan is needed. If you’d like a recurring scan, please indicate the frequency.
Seed Type
Seed will be crawled and archived according to the default crawl scope. We recommend using this seed type for most websites. We will be using Brozzler.
Seed will be archived according to the default crawl scope, and will also include the first page of any URLs directly linked off of your seed. Examples of content that could be scoped in include social media feeds, YouTube channels, news articles on other platforms etc. This may significantly increase the amount of data the crawler is able to capture so a test crawl will be done first, if you choose to use this seed type.
Only the first page of your seed will be archived. Links to other pages will not be crawled. We recommend using this seed type for things like newspaper articles, blog posts, Wikipedia entries, and any other pages that you wish to archive without archiving their entire contexts.
The first page of your seed, as well as the first page of any URLs directly linked off of your seed, will be archived. We recommend using this seed type for things like news feeds or other pages that contain a list of links to pages on other domains that you would nevertheless like to capture. Examples of content that could be scoped in include social media feeds, YouTube channels, news articles on other platforms etc.
Add the URL(s) for the pages you would like us to archive. These are known as Seeds.
Website URL(s) to Archive (Seeds)
Please provide any extra information concerning your request.