VEP Early Modern Drama Collection
We have curated three corpora of drama-related texts. These corpora are differentiated by a widening definition of what constitutes ‘drama,’ and an extended cut-off date. Each corpus is released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Before downloading the corpora, read about the format of the text files here, and about our text processing workflow. Corpora are generated from Text Creation Partnership (TCP) XML files.
Please note that our download corpora do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. However, you can download and explore metadata and Ubiqu+ity token-counts from both the Phase I and Phase II texts–and you can use Ubiqu+Ity to tag all of the files in any corpus with your own rules.
Early Modern Drama Corpora
Core Drama 1660
The ‘core’ group of Early Modern dramatic texts: professional and other plays intended for performance. Includes translations of plays and closet drama. Cut-off date: 1660.
There are 554 plays in this corpus, of which 471 are EEBO-TCP Phase I texts.
Only EEBO-TCP Phase I texts are available for download.
However, metadata and statistical analysis is available for all plays in the corpus from the Metadata Builder.
- Download Core Drama 1660 SimpleText plain text files
- zip contents: 471 unrestricted SimpleText plain text files; README_SimpleText_files.txt; tcp_restricted.txt
- size: 18.7 MB zipped; 48 MB unzipped
- Download Core Drama 1660 Metadata (via the Metadata Builder)
- Download Core Drama 1660 Metadata README (PDF)
- Download Core Drama 1660 Ubiqu+Ity Tokens Files
- zip contents: 471 Ubiqu+Ity Tokens csv files, TextViewer.html, VEP_Core_Drama_1660_v2_ubiq_ds321.csv, README_Ubiquity_tokens_files.txt,
- size: 43.1 MB zipped; 241 MB unzipped
- Download Core Drama 1660 1-Grams (csv, right-click save as)
Expanded Drama 1660
This corpus expands the ‘Core Drama 1660’ corpus with a wider definition of what constitutes a dramatic text, to include masques and entertainments. Cut-off date: 1660.
There are 666 plays in this corpus, of which 569 are EEBO-TCP Phase I texts.
Only EEBO-TCP Phase I texts are available for download.
However, metadata and statistical analysis is available for all plays in the corpus from the Metadata Builder.
- Download Expanded Drama 1660 SimpleText plain text files
- zip contents: 569 unrestricted SimpleText plain text files; README_SimpleText_files.txt; tcp_restricted.txt
- size: 20.4 MB zipped; 52.4 MB unzipped
- Download Expanded Drama 1660 Metadata (via the Metadata Builder)
- Download Expanded Drama 1660 Metadata README (PDF)
- Download Expanded Drama 1660 Ubiqu+Ity Tokens Files
- zip contents: 569 Ubiqu+Ity Tokens csv files, TextViewer.html, VEP_Expanded_Drama_1660_v2_ubiq_ds321.csv, README_Ubiquity_tokens_files.txt
- size: 67.4 MB zipped; 263 MB unzipped
- Download Expanded Drama 1660 1-Grams (csv, right-click save as)
Expanded Drama 1700
This corpus aims to include one copy of all dramatic texts in print up to 1700: it contains professional and other plays intended for performance, translations, closet drama, masques, and entertainments.
There are 1,244 plays in this corpus, of which 1,008 are EEBO-TCP Phase I texts and 1 is an ECCO-TCP text.
Only EEBO-TCP Phase I texts and the ECCO-TCP text are available for download.
However, metadata and statistical analysis is available for all plays in the corpus from the Metadata Builder.
- Download Expanded Drama 1700 SimpleText plain text files
- zip contents: 1,009 unrestricted SimpleText plain text files; README_SimpleText_files.txt; tcp_restricted.txt
- size: 36.3 zipped; 93.2 MB unzipped
- Download Expanded Drama 1700 Metadata (via the Metadata Builder)
- Download Expanded Drama 1700 Metadata README (PDF)
- Download Expanded Drama 1700 Ubiqu+Ity Tokens Files
- zip contents: 1,009 Ubiqu+Ity Tokens csv files, TextViewer.html, VEP_Expanded_Drama_1700_v2_ubiq_ds321.csv, README_Ubiquity_tokens_files.txt
- size: 83.9 MB zipped; 468 MB unzipped
- Download Expanded Drama 1700 1-Grams (csv, right-click save as)
Additionally, you can download a list of All Known Texts. This is not a downloadable corpus, since in some cases it lists texts not available in the TCP. This list includes texts which appear more than once in the EEBO-TCP.
This list has 1,554 entries, of which 1,292 are TCP texts-of these, 1,046 are EEBO-TCP Phase I texts and 1 is an ECCO-TCP text.
- Download list of All Known Texts for the Early Modern Drama Collection (csv, right-click save as)
Credits: Metadata was prepared by Jonathan Hope and Beth Ralston. XML files were processed and curated by Deidre Stuffer for release as plain text files.