Tag: Luckritz Marquis

Spring 2019 Corpora Release 2.7.0

We at Coptic Scriptorium are pleased to version 2.7.0 of our corpora.  The release includes several new documents:

  • several more sayings in the Coptic Apophthegmata Patrum (edited & annotated by Marina Ghaly)
  • additional fragments of Shenoute’s sermon Some Kinds of People Sift Dirt (edited & annotated by Christine Luckritz Marquis, editions provided by David Brakke)
  • Besa’s letter On Vigilance (edited and annotated by So Miyagawa and others)
  • several more fragments of the monastic canons of Apa Johannes (annotated by Elizabeth Platte and Caroline T. Schroeder, digital edition provided by Diliana Atanassova)

All documents have metadata for word segmentation, tagging, and parsing to indicate whether those annotations are machine annotations only (automatic), checked for accuracy by an expert in Coptic (checked), or closely reviewed for accuracy, usually as a result of manual parsing (gold).

You can search all corpora at https://corpling.uis.georgetown.edu/annis/scriptorium and download the data in 4 formats (relANNIS database files, PAULA XML files, TEI XML files, and SGML files in Tree-tagger format).

Our total annotated corpora are now at over 780,000 words; corpora that have human editors who reviewed the machine annotations amount to over 100,000 words.

Enjoy!

Recent presentations by Coptic Scriptorium team members (post 1 of 2)!

This fall, Coptic Scriptorium team members have presented their work in a number of environments.

Research Talk, Georgetown University Linguistics Speaker Series

In September, as part of the Georgetown University Department of Linguistics Friday Speaker Series, the project presented a summary of our latest work and our goals for the new NEH Digital Humanities Advancement Grant we received. “A Linked Digital Environment for Coptic Studies”.  Caroline T. Schroeder provided an overview of the project. Amir Zeldes presented the technology required to machine-process Coptic text in order to produce an annotated, digital corpus and linked online lexicon. Rebecca Krawiec discussed the research potential of an annotated digital corpus for research in early monasticism. Elizabeth Platte introduced the concept of linked data and demonstrated our linked geographic data features. (Christine Luckritz Marquis was scheduled present research on space and place in monastic literature but was unfortunately sidelined by a hurricane.)

Rebecca Krawiec, Elizabeth Platte, Amir Zeldes, Caroline T. Schroeder at Georgetown University, 2018

Rebecca Krawiec, Elizabeth Platte, Amir Zeldes, Caroline T. Schroeder at Georgetown University, 2018

Material of Christian Apocrypha Conference

In December, Caroline T. Schroeder gave a paper at the Material of Christian Apocrypha Conference hosted at the University of Virginia, under the auspices of the North American Society for the Study of Christian Apocryphal Literature.  Dr. Schroeder’s paper, “The Materiality of Digital Apocryphal Studies,” addressed the role of digital humanities in studying the colonial history of manuscripts, people and places in early Christian literature, and public humanities.  It was part of a panel on Christian Apocrypha and the Digital Humanities, which also included papers by James Walters (Rochester College) on “The Digital Syriac Corpus: A New Resource for the Study of Syriac Texts” and  Brandon Hawk (Rhode Island College) on “The Medieval Social Network of the Gospel of Pseudo-Matthew”.  Datasets used in the presentation are available at Dr. Schroeder’s GitHub site.

Caroline T. Schroeder presenting about the manuscripts digitized by Coptic Scriptorium

Caroline T. Schroeder presenting about the manuscripts digitized by Coptic Scriptorium

Caroline T. Schroeder presenting visualizations of occurrences of proper names in some of Coptic Scriptorium's corpora

Caroline T. Schroeder presenting visualizations of occurrences of proper names in some of Coptic Scriptorium’s corpora

New Release of Corpora

We’re pleased to announce that we’ve released more texts in our corpora.

The Sayings of the Desert Fathers (Apophthegmata Patrum) corpus now contains 52 sayings/apophthegms (>7100 words).  We have edited previously published sayings for consistency in annotation, and we’ve released new sayings edited by Christine Luckritz Marquis, Elizabeth Platte, and our newest contributor, Dana Robinson.  Read or browse the Sayings online.  Click on the “Analytic” button to see read a saying in Coptic with a parallel English translation + part of speech tags for each Coptic word.

Or click on the “Norm” button (short for “normalized”) to read the Coptic.  Clicking on any Coptic word in the normalized visualization will take you to an online Coptic-English dictionary.  Hovering your cursor over a passage in the normalized visualization will show the English translation in a pop up window.

AP 96 Normalized view screenshot

AP 96 Normalized view screenshot

Shenoute’s I See Your Eagerness now has numerous new manuscript fragments published (over 16,000 words).  We also have edited previously published witnesses for consistency in annotation.  These documents were transcribed and collated from the manuscripts by David Brakke and annotated for digital publication by Rebecca Krawiec.  Now you can read Shenoute’s I See Your Eagerness in nearly its entirety in Coptic.  We provide several paths for you to explore this text:

  1. Read the text from start to end, beginning with the first manuscript fragment. Click “NEXT” to keep reading.
    MONB.GL fragment D diplomatic visualization

    MONB.GL fragment D diplomatic visualization

    (No English translation is provided, but in the “Note” metadata field below the Coptic, you can find page numbers for David Brakke’s and Andrew Crislip’s translation in their book, Discourses of Shenoute.)  “Next” and “Previous” buttons will take you through the path we consider optimal for reading the text. This path wanders through various manuscript witnesses, following the path with the fewest lacunae. Want to see parallel witnesses? Check out the “Witness” metadata field below the text.

    MONB.GL 29-30 metadata screenshot

    MONB.GL 29-30 metadata screenshot

  2. Read through all surviving pages in one codex/manuscript witness by filtering for a particular codex. Click through the documents in that codex.  For example, if you want to read through all the fragments of codex MONB.GL, go to data.copticscriptorium.org, and use the menu to filter by Corpus for the shenoute.eagerness corpus, and then filter by manuscript name for the MONB.GL codex.   Click through the documents in that codex.
  3. Perform a search/query in our ANNIS database.   For example, search for all occurrences of “wicked” (ⲡⲟⲛⲏⲣⲟⲛ) in the corpus.  Or, search for occurrences of “wicked” controlling for duplicate hits in parallel manuscript witnesses.  See our guide to queries in ANNIS  for more tips.

You also can download the entire corpus in TEI XML, PAULA XML, and relANNIS formats  from our GitHub site.

Coptic SCRIPTORIUM at the Coptic Congress

Much of the Coptic SCRIPTORIUM team is in Claremont this week for the Congress of the International Association of Coptic Studies.

We started out with a pre-conference, 2-day workshop with our KELLIA partners from Germany, where we worked on sharing data and technologies across digital Coptic projects.  Look here soon for an announcement about a really cool fruit of our labors.

Thursday there are two panels, and Friday there are two workshops.

Thursday 2-4 pm Coptic Digital Studies (Burkle 16)

David Brakke chair 

Prof. Dr. Caroline Schroeder, Coptic SCRIPTORIUM: A Digital Platform for Research in Coptic Language and Literature

Dr. Christine Luckritz Marquis, Reimagining the Apopthegmata Patrum in a Digital Culture

Prof. Amir Zeldes, A Quantitative Approach to Syntactic Alternations in Sahidic

Dr. Rebecca Krawiec, Charting Rhetorical Choices in Shenoute: Abraham our Father and I See Your Eagerness as case-studies

Thursday 4:30-6:30 Coptic Digital Humanities (Burkle 16)

Caroline T. Schroeder, Chair

Dr. Paul Dilley, Coptic Scriptorium beyond the Manuscript: Towards a Distant Reading of Coptic Texts

Mr. So Miyagawa and Dr. Marco Büchler, Computational Analysis of Text Reuse in Shenoute and Besa

Mr. Uwe Sikora, Text Encoding – Opportunities and Challenges

Ms. Eliese-Sophia Lincke, Optical Character Recogition (OCR) for Coptic. Testing Automated Digitization of Texts with OCRopy

 

Friday 11-12:30 Workshop on Coptic Fonts & Coptic Bible (AA)

Christian Askeland, Frank Feder

Friday 4:30-6 Digital Tools for Beginners (Workshop on Coptic SCRIPTORIUM)

Caroline T. Schroeder, Amir Zeldes, Rebecca S. Krawiec

2015 Society of Biblical Literature (SBL) Annual Meeting

The 2015 SBL conference is the largest gathering of scholars interested in the study of religion in the world. The latest research and the fostering of collegian contacts is present at each showcase. The 2015 meeting is taking place in Atlanta , GA and members of the Coptic SCRIPTORIUM  team will be presiding at the conference.

Digital Humanities in Biblical, Early Jewish, and Christian Studies; Papyrology and Early Christian Backgrounds will have Caroline T. Schroeder chairing , Taking place in International A (International Level) – Marriot on 11/23/2015 at 9:00 AM to 11:30 AM.

Also,

“A Wild Patience Has Taken this Far”: Future Avenues of Feminist Scholarship will also have Caroline T. Schroeder presenting, Taking place in Room A602 (Atrium  Level) – Marriot at 4:00 PM to 6:30 PM.

and

Art and Religions of Antiquity; Violence and Representations of Violence among Jews and Christians will have Christine Luckritz Marquis chairing, Taking place in Room 209 (Level 2) – Hilton on 11/23/2015 at 9:00 AM to 11:30 AM.

We are looking forward to their presentations and hope others are as well!