Corpora

VEP Early Modern 1080 Collection

Before downloading the Early Modern 1080 corpus, read about the format of the text files here, and about our text processing workflow. Corpora are generated from Text Creation Partnership (TCP) XML files. Our downloads do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. The corpus is released under a Creative Commons Attribution-NonCommercial-ShareAlike 4. Read more…

VEP Early Modern Drama Collection

We have curated three corpora of drama-related texts. These corpora are differentiated by a widening definition of what constitutes ‘drama,’ and an extended cut-off date. Each corpus is released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Before downloading the corpora, read about the format of the text files here, and about our text processing workflow. Corpora are generated from Text Creation Partnership (TCP) XML files. Please note that our download corpora do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. Read more…

VEP Early Modern Science Collection

VEP proudly presents two corpora of early modern scientific writing, curated by Alan Hogarth. They are released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Before downloading the corpora, read about the format of the text files here, and about our text processing workflow. Corpora are generated from Text Creation Partnership (TCP) XML files. Please note that our download corpora do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. Read more…

VEP Shakespeare Collection

The following corpora are released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Before downloading the corpora, read about format of the text files here, and about our text processing workflow. Corpora are generated from Text Creation Partnership (TCP) XML files. Our downloads do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. Read more…

VEP TCP Collection

The following corpora are released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Before downloading the corpora, read about the format of the text files here, and about our text processing workflow. Corpora are generated from Text Creation Partnership (TCP) XML files. Please note that our download corpora do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. Read more…