Thanks to Judith I have been playing with JSTOR’s Data for Research (DfR). They provide a faceted way of visualizing and search the entire JSTOR database. Features include:
- Full-text and fielded searching of the entire JSTOR archive using a powerful faceted search interface. Using this interface one can quickly and easily define content of interest through an iterative process of searching and results filtering.
- Online viewing of document-level data including word frequencies, citations, key terms, and ngrams.
- Request and download datasets containing word frequencies, citations, key terms, or ngrams associated with the content selected.
- API for content selection and retrieval. (from the About page)
I’m impressed by how much they expose. They even have a Submit Data Request and an API. This is important – we are seeing a large scale repository exposing its information to new types of queries other than just search.