Data for Research is a free service from JSTOR available to the broader research community, including independent scholars, computer scientists, digital humanists, and really anyone interested in the study of mining data for the purpose of uncovering new trends and patterns valuable to current scholarship and a deepening understanding of the humanities.
Created in 2008, DfR is a self-service tool that allows users to select and interact with content data in the JSTOR archive such as data from scholarly journal literature (more than 7 million articles) and a set of primary resources (26,000 19th Century British Pamphlets).
Specific web-based tools in the DfR interface include:
- a powerful faceted search interface that can be leveraged to define content of interest through an iterative process of searching and results filtering
- word frequencies, citations, key terms, and n-grams utilized for conducting analysis of document-level data
- topic modeling (classification of subject headings at the article level), a powerful tool for content selection and filtering
- downloadable datasets containing word frequencies, citations, key terms, or n-grams associated with the content selected
- visualization tools
The self-service data set options available to the user are limited to up to 1,000 articles.
Contact the DfR Staff
Please fill out the form below if you have any questions/would like to look into more expansive data sets.