We are happy to announce the release of the automatically annotated Sahidic Old Testament corpus (corpus identifier: sahidic.ot), based on the version of the available texts kindly provided by the CrossWire Bible Society SWORD Project thanks to work by Christian Askeland, Matthias Schulz and Troy Griffitts.
The corpus is available for search in ANNIS, much like the Sahidica New Testament corpus, together with word segmentation, morphological analysis, language of origin for loanwords, part of speech tagging and automatically aligned verse translations (except for parts of Jeremiah). Please expect some errors, due the fully automatic analysis in the corpus. The aligned translation is taken from the World English Bible. Here is an example search for the word ‘soul’:
You can also read entire chapters in ANNIS or at our repository, which look like this:
We hope that this resource will be helpful to Coptic scholars – please let us know if you have any questions or comments!
We have concluded our round of “startup” funding from the National Endowment for the Humanities Office of Digital Humanities. Our White Paper documents our activities and our outcomes for the period, including the following grant products:
- A Digitized Coptic Corpus in Multiple Formats and Visualizations
- Digital and Computational Tools (tokenizer, part of speech tagger, lemmatizer, and more and more)
- ANNIS Database instance to query and search the multilayer corpus
- Documentation in the toolsets, on our wiki, and on our blog
- Web application for users to reading and cite visualizations of textual data
- Symposium and workshop (“Digital Coptic 2,” March 2015) at Georgetown U + public tutorial and workshop at the Coptic Congress
- Articles and conference papers to distribute the results of our work
CHECK IT OUT! We heartily thank the NEH ODH for its support, as well as the NEH Preservation and Access division for their concurrent grant. We also thank all of our participants, contributors, and collaborators, who are numerous and are outlined in the White Paper.
White Paper for NEH ODH Startup Grant
See also our White Paper for the P&A grant submitted in August.
We at Coptic SCRIPTORIUM have been fortunate to have received three grants from the National Endowment for the Humanities for our work. We cannot thank the NEH enough for its support. So much of what we have done over the past 2+ years could not have happened without this funding.
We just completed a White Paper paper for a Foundations grant from the Humanities Collections and Reference Resources program in the Division of Preservation and Access. The grant, “Coptic SCRIPTORIUM: Digitizing a Corpus for Interdisciplinary Research in Ancient Egyptian,” ran from May 2104 until now.
Our White Paper documents our work and especially the standards and practices we developed for digitizing a pilot Coptic corpus.
If you want to know more about what truly interdisciplinary DH work looks like, check it out. We try to break down the complexities of creating a digital corpus for research in linguistics, history, religious studies, biblical studies, manuscript studies. We’ve got data models, workflows, digitization standards, transcription guidelines, and more all laid out for you here.
There is so much more to do; this is a only start. Thanks to everyone who has had faith in our work.
White Paper, NEH Grant PW-51672-14 (Preservation and Access): “Coptic SCRIPTORIUM: Digitizing a Corpus for Interdisciplinary Research in Ancient Egyptian” 29 August 2016
Amir Zeldes and Caroline T. Schroeder have recently published an article in Digital Humanities Quarterly about the need for digital tools and a digitized corpus for Coptic, and research questions that drive Coptic SCRIPTORIUM.
“Raiders of the Lost Corpus” is freely available on the DHQ website as part of a special issue on Digital Methods and Classical Studies edited by Neil Coffee and Neil W. Bernstein. Schroeder presented an earlier version of this paper at the Digital Classics conference at the University at Buffalo in 2013.