Coptic SCRIPTORIUM Blog

Page 2 of 10

Hiring for a 1-year Postdoctoral Fellow

Coptic Scriptorium is hiring for a 1-year, full-time Postdoctoral Fellow to work with us on expanding the number of Coptic texts we have available and on annotating those texts.

Full details are in the job ad. This position is remote with virtual meetings and occasional travel to the University of Oklahoma or Georgetown University or other work sites (travel funding covered by OU).

If you are considering applying but have a dissertation defense/completion date firmly set during August (rather than prior to August 1), we can consider your application; please note in your letter your specific timetable, and if we decide to interview you we can discuss the timetable in the interview.

If you applied for the summer position previously advertised and would like to apply for this position, as well, please do send a new full application for this position.

We will begin reviewing applications next week and will conduct virtual interviews either at the end of this month or very early June. This position is funded jointly by a grant from the National Endowment for the Humanities and the University of Oklahoma Office of the Vice Provost for Research and Partnerships.

OU-CS-Digital-Humanities-Research-Fellow-Job-Ad-Fall-2023 Download

Hiring for a part-time summer position!

May 2, 2023 / ctschroeder / 0 Comments

We are hiring for a summer part time position! The full description is below, but the highlights are:

must know Coptic
10-20 hours per week
remote work
supervised by Prof. Caroline T. Schroeder at the University of Oklahoma
position begins May 15 or as soon thereafter as the hiring paperwork etc. can be completed
send a letter, CV, and names/contact info for 2 references to WGS@ou.edu

In addition, because of the nature of the position, we can only hire someone who is in the US and eligible to work in the US.

We will begin reviewing applications May 8.

This is perfect for a grad student, recent PhD, or part-time academic looking for some extra income over the summer

Busy this summer but wish you could apply? We will be hiring for one or two more positions to start in August or September. Watch this space!

OU-Digital-Humanities-Specialist-Job-Ad-Summer-2023 Download

Coptic Scriptorium Awarded an NEH Grant to Expand Corpora and Add More Dialects

April 19, 2023 / ctschroeder / 0 Comments

The Coptic Scriptorium team is honored to have been awarded an NEH Preservation and Access/Humanities Collections and Reference Resources Implementation Grant in the amount of $349,887. This award will fund a 3-year project Expanding Coptic Digital Online Collections. You can read the press release and list of awarded grants on the NEH site. This initiative will enable Coptic Scriptorium to improve the user experience, to expand our digital database of richly annotated texts in the Sahidic dialect, and to develop natural language tools and searchable, annotated, digitized corpora for additional dialects, including Bohairic. Caroline T. Schroeder (University of Oklahoma) is PI, and Amir Zeldes (Georgetown University) is co-PI. The team also includes Rebecca Krawiec (Canisius College), Christine Luckritz Marquis (Union Presbyterian Seminary), and Hany Takla (St. Shenouda Society), as well as a diverse advisory board. We thank our whole team past and present for the work that led to this stage, and we are grateful to the National Endowment for the Humanities for their ongoing support.

New Corpora Release 4.4.0

September 30, 2022 / Amir Zeldes / 0 Comments

Searching for Greek words in Shenoute’s *So Concerning the Little Place*

We are pleased to announce release 4.4.0 of Coptic Scriptorium! Our data now includes over 1,267,000 tokens of searchable, linguistically analyzed Coptic data from dozens of ancient Coptic works (an increase of almost 100,000 tokens from the previous release). We are very grateful to all of our collaborators and contributors, without whom this project could not function.

This release corrects a large number of consistency errors identified in our existing data, and also adds some new documents:

Sections of three works by Shenoute of Artipe:
New documents added to existing works:
- Acephalous Work 22
- Apophthegmata Patrum
The remaining books 2-4, as well as the postscript of Pistis Sophia, which are now added to the previously released book 1 in our online interfaces
Newly treebanked data with syntactic gold standard annotations for the Life of John the Kalybites, part 1

We would like to thank the Marcion Project for making the underlying digitized text of Pistis Sophia available, and all of the annotators for their hard work. Tamara Siuda, Rebecca Krawiec, Philippe Zaher, and Lance Martin contributed, in addition to Amir and Carrie. As our current DHAG grant ends, we would like to give special thanks to Lance, who has been working as our DH specialist on the project since 2019, for doing an amazing job of keeping track of all the data and the various tasks he’s been in charge of over the past three years!

As with all releases, raw machine readable data for all corpora can be found, including morphological and syntactic analysis, as well as named entity recognition and entity linking, on our GitHub repository, in a variety of popular formats:

https://github.com/copticscriptorium/corpora

You can also search for complex linguistic annotations in the data using our ANNIS server – please see our new tutorial here to get started with some query tips and a helpful cheat sheet:

https://copticscriptorium.org/ANNIS_tutorial

We hope this release will be useful and look forward to the next one as always!

New links for tools and services

August 5, 2022 / ctschroeder / 0 Comments

After our recent server outage, we’ve been re-installing our tools and software. Some of our services are now available at new URLs.

The ANNIS database is now at https://annis.copticscriptorium.org/annis/scriptorium

Our Sahidic Coptic natural language processing tools are at https://tools.copticscriptorium.org/coptic-nlp

Our GitDox annotation tool is at https://tools.copticscriptorium.org/gitdox/scriptorium

The Coptic Dictionary online is still at https://coptic-dictionary.org, and our tool for browsing and reading texts is still at https://data.copticscriptorium.org

Thanks for your patience!

Coptic Scriptorium services are back online!

July 14, 2022 / ctschroeder / 0 Comments

Thank you to Amir and the staff at Georgetown University. Most of our public applications, such as the ANNIS database and the Coptic Dictionary Online, are back in service.

Coptic Dictionary and ANNIS database down

June 19, 2022 / ctschroeder / 0 Comments

We are sorry to report that the server that hosts the Coptic Dictionary Online and Coptic Scriptorium’s ANNIS database are down. (Likewise some of the NLP tools and internal tools like GitDox are down.)

We are working on fixing the problem, but for now we do not have a timeline for when they will be up and running.

In the meantime reading and browsing texts at http://data.copticscriptorium.org still work.

Thank you for your patience! We will let you know when the systems are up again.

New Corpora Release 4.3.0

May 3, 2022 / Amir Zeldes / 0 Comments

The opening lines of Pistis Sophia

It is our pleasure to announce release 4.3.0 of Coptic Scriptorium corpora, which currently cover over 1,175,000 tokens of searchable, linguistically analyzed Coptic data from dozens of ancient Coptic works. New in this release:

The History of Eustathius and Theopiste (hagiography, annotations by Lance Martin)
Pistis Sophia, book 1 (Gnosticism, annotations by Lance Martin, Tamara Siuda, Caroline T. Schroeder and Amir Zeldes)
Life of Pisentius, part 3 (hagiography, annotations by Tamara Siuda, Lance Martin, Caroline T. Schroeder)

Corrections and additional annotations:

Pilot work adding partial Arabic translations (work by Philippe Zaher)
- Apophthegmata Patrum
- Abraham our Father by Shenoute
Improvements and error corrections to a variety of works (including Because of You Too O Prince of Evil, Dormition of John, Book of Ruth and Homilies of Proclus)

The newly released material encompasses over 57,000 tokens of semi-automatically annotated data. We would like to give special thanks to the Marcion Project for making much of the underlying digitized text available, and the annotators whose hard work has made this release possible. As with all releases, raw machine readable data for all corpora can be found, including morphological and syntactic analysis, as well as named entity recognition and entity linking, on our GitHub repository, in a variety of popular formats:

https://github.com/copticscriptorium/corpora

We hope this release will be useful and look forward to the next one!

Example of research using the online Coptic Dictionary: standalone G Thomas transcription

December 21, 2021 / ctschroeder / 2 Comments

Martijn Linssen, an independent researcher, has been working on the Gospel of Thomas for some time and recently published a stand-alone “interactive Coptic-English translation” of the Gospel of Thomas on his Academia.edu site. The Coptic is linked to entries in the online Coptic Dictionary! We invite you to check it out!

We are always excited to see what kind of work people are doing with our project. Please get in touch if you’ve been using the dictionary or any of Coptic Scriptorium’s tools, corpora, annotations, etc., in your work!

The online dictionary is part of the KELLIA collaboration between Coptic Scriptorium (Georgetown University and the University of Oklahoma), the Berlin-Brandenburg Academy, the Goettingen Academy, the Free University in Berlin, and Goettingen University.

New Corpora Release 4.2.0

September 30, 2021 / Amir Zeldes / 0 Comments

**Automatic linguistic analysis and Entity Linking from I Samuel 25**

It is our pleasure to announce the latest data release from Coptic Scriptorium, version 4.2.0. This release contains both new Coptic material and additions to older datasets, as well as expanding our entity annotations and named-entity linking to all of our data, including the semi-automatically annotated Old Testament. The also means automatic updates to all of our interfaces, such as the recently added example usage functionality in the Coptic Dictionary Online, which is linked to the corpora.

The new material, including more digitized data courtesy of the Marcion project, as well as manually digitized and corrected OCR data from out of print editions includes:

Encomium of Pseudo-Celestinus on Victor (annotations by Mitchell Abrams and Lance Martin)
Encomium of Pseudo-Flavianus on Demetrius, Archbishop of Alexandria (annotations by Mitchell Abrams, Lance Martin and Amir Zeldes)
Added works by Shenoute of Atripe:
- In the Night (Canons 9, annotations by Lance Martin, Caroline T. Schroeder and Amir Zeldes)
- Because of You Too O Prince of Evil (Discourses 4, annotations by Tamara Siuda, Lance Martin and Caroline T. Schroeder)
Expansions and improvements of existing corpora:
- More Apophthegmata Patrum (work by Christine Luckritz Marquis, So Miyagawa, Caroline T. Schroeder and Amir Zeldes)
- Further material from Shenoute’s works:
  - God Says Through Those Who Are His (including parallel witnesses and new material, data courtesy of David Brakke, annotations by Rebecca Krawiec, Lance Martin, Dana Robinson, Caroline T. Schroeder)
  - Acephalous Work 22 (data courtesy of David Brakke, annotations by Elizabeth Davidson, Rebecca Krawiec, Elizabeth Platte, Caroline T. Schroeder, Amir Zeldes)
- More syntactically annotated gold treebanked data in the Coptic Treebank
- Completely re-annotated Old Testament corpus, based on the base text courtesy of the Digital Edition of the Coptic Old Testament (CoptOT) project – with improved segmentation and parsing, now complete with semi-automatic entity recognition and linking to Wikipedia entries for people and places

With this new release, the semi-automatically annotated data (excluding automatically processed Bible materials) in the project covers close to 300,000 words of Sahidic Coptic annotated for entities.

This release represents a tremendous amount of work over the past few months by the Coptic Scriptorium team. We would also like to thank individual contributors (which you can always find in the ‘annotation’ metadata for each document), and specifically So Miyagawa for help with Coptic OCR models, as well as the Marcion and CoptOT project for sharing their data with us, and the National Endowment for the Humanities for supporting us. We are continuing to work on more data, links to other resources and new kinds of annotations and tools. Please let us know if you have any feedback!