Where does data come from? In this assignment, you will make a deep dive into the archives to find an original, data primary source that predates the era of mass digitization.
Part I: Identify and visit a dataset in the archives.
Find a pre-digital data source at a Boston-area archive or museum, and look at it. Take careful notes. You must go to the archive itself; use of online sources is prohibited, and only with special permission may you visit an object in the archives that already exists online.
What is a pre-digital data source? As we’ve discussed, in some ways almost anything could be seen as data; and data storage takes many different forms. But for the purposes of this assignment, you should choose something that most everyone would agree to align closely with modern quantitative data. That is, they should be structured stores of information that today would be kept in a spreadsheet or database. It is very likely that most good sources will be handwritten, though they may have been made on a typewriter. Without explicit prior permission, you may not use as a source any commercially printed books, any collections of letters or correspondence, or any printed maps. Some useful keywords to search for on finding aids may be “ledger”, “account book”, “logbook”, “log”, “tables.”
Take care in choosing a source. Double entry bookkeepping is notoriously difficult to understand, for example: and handwriting from before about 1860 can be extremely difficult to read. And don’t be afraid to change your source in the archives for a better one.
Planning your visit
It will take some time to get to your archive and explore the items there. Be sure to plan your time in advance, and know that most archives are open only Monday-Friday, 9am-5pm. We have left two full weeks for you to make your archival visit, so you should have time to make a visit. If you do not, contact us immediately.
The Boston area is full of archives, and in doing this assignment you should take the possibility to explore them. The point of this exercise is to explore sources around Boston, so only by special permission may you choose Northeastern’s own archives. We strongly encourage you to explore more interesting archives or sources; choosing an interesting source will be reflected well upon.
Some of the major archives within walking distance of Northeastern are:
- The Boston Public Library (Copley)
- The Massachusetts Historical Society (Fenway)
- The Countway Library for the History of Medicine (Harvard Medical School, Fenway)
Slightly farther afield, but accessible via public transportation, are:
- The Massachusetts State Archives (Dorchester; Red line.)
- The National Archives and Records Administration. (Waltham, Bus)
- Houghton Library (Harvard University, Red line).
- The MIT Institute Archives & Special Collections. (MIT, red line).
- The City of Boston Archives (West Roxbury, Commuter rail).
There are also countless smaller archives for particular institutions, from clubs and immigrant societies to political groups to churches. If you have any academic or extracurricular interests, you might be able to find a particularly interesting source.
The Appalachian Mountain Club keeps logbooks of everyone who has climbed a mountain in their network. The Boston Symphony Orchestra and the Museum of Fine Arts have their own archives. Most churches, synagogues, or mosques would have records about their early membership. If there is an institution or subject area you’re interested in, we’ll see what we can find.
You’ll want to contact the archive before you arrive to make sure they have the material on site, and that it’s open to the public.
Archival policies on computers, photography, and so forth vary. If you have a digital camera, it’s a good idea to bring it; ask whether you can post any pictures of the documents online.
Part II. Writeup.
You should write up your artifact as a 5-7 page paper. The page length does not include images. 5-7 pages should be about 1600 to 2300 words double-spaced in 12-point font. The point of this is not historical argumentation, but close and detailed description that points to the limits of what you know about the artifact and what can be known about it. Structure your paper in ways appropriate for the artifact.
In writing up, you should consider including the following elements or addressing the following questions.
- Who created and stored the information inside? (Was it an individual? A clerk for a larger institution? You should feel free to speculate and admit what you don’t know.)
- Images and/or representations that describe what the data looks like or how confusing elements appear.
- How is the data organized? What could you learn about the goals, preferences, and worldview of the people who created it from its organization?
- Are there idiosynracies in the way the data was collected? Unexpected features? Highlight these and describe why they might take the form they do.
- Is there something, or are there several things, you don’t understand about the data? Is something arranged strangely? Is there an abbreviation you don’t know? Ideally these questions should be open enough that someone in the class might have an idea: indeed, you should have a few suggestions yourself.
Do not include a description of the archive, except insofar as it actually effects the data you’re looking at.
Outline–but do not implement!–a plan for digitizing the data here into a form that could be used for further research. If you were going to store it in a digital spreadsheet or database, what sort of fields would you collect information on? Could you store the information in a spreadsheet or database? What sorts of questions could you answer by having the entire dataset digitized? (Assume that you have all the technical analysis capabilities needed to do so). How much time and effort would it take to create a digital version? What aspects of the document might be lost in your planned transition?
Turn in your paper as a PDF over blackboard, by Monday October 10 at 5pm.
Your paper will be graded on the basis of:
- The apparent effort you have put into finding and examining a distinctive item from an archive. Choosing a particularly interesting source, or a particularly archive to visit, can count on your behalf.
- Your ability to describe interesting features of the item you exam, and to raise questions about how the data was collected.
- The specificity of your description.
- The thoughtfulness of your digitization plan, and your ability to thoughtfully describe both the benefits and losses of the choices you make for digitizing.
- Whether your paper successfully addresses the questions about data laid out above.
Should you use any secondary works, you should cite the works that you quote and refer to in the text in a consistent format. We recommend the Chicago documentary note format: with it, you give a full citation the first time you use a text, and smaller ones later. For short papers like this, you may omit the final bibliography. If you prefer to use a social-science author-date format with final bibliography, that is also acceptable.
If you are worried about formatting your citations correctly or keeping track of the sources you use, I strongly recommend the open-source citation software Zotero. This will automatically pull citations from the web, and you can drag and drop into a paper to get a formatted citation. Just be aware that online library sources may give you extraneous information, such as the language or a URL. Edit the fields in the library until drag-and-drop gets you good results.
Your primary source for this assignment will be the archival document(s) that you describe: be sure to cite it according to the standards of the archive. This will mean, at a minimum, that you’ve described it comprehensively enough so that a future researcher could easily find it at the archives themselves.
You should also acknowledge any archivists or peers who help you to better understand the materials. Such acknowledgements would typically come either as a footnote to the first paragraph (for general assistance) or as a footnote to the specific place you received help. You don’t need to thank me!