Tag: KELLIA

Coptic SCRIPTORIUM at the Coptic Congress

Much of the Coptic SCRIPTORIUM team is in Claremont this week for the Congress of the International Association of Coptic Studies.

We started out with a pre-conference, 2-day workshop with our KELLIA partners from Germany, where we worked on sharing data and technologies across digital Coptic projects.  Look here soon for an announcement about a really cool fruit of our labors.

Thursday there are two panels, and Friday there are two workshops.

Thursday 2-4 pm Coptic Digital Studies (Burkle 16)

David Brakke chair 

Prof. Dr. Caroline Schroeder, Coptic SCRIPTORIUM: A Digital Platform for Research in Coptic Language and Literature

Dr. Christine Luckritz Marquis, Reimagining the Apopthegmata Patrum in a Digital Culture

Prof. Amir Zeldes, A Quantitative Approach to Syntactic Alternations in Sahidic

Dr. Rebecca Krawiec, Charting Rhetorical Choices in Shenoute: Abraham our Father and I See Your Eagerness as case-studies

Thursday 4:30-6:30 Coptic Digital Humanities (Burkle 16)

Caroline T. Schroeder, Chair

Dr. Paul Dilley, Coptic Scriptorium beyond the Manuscript: Towards a Distant Reading of Coptic Texts

Mr. So Miyagawa and Dr. Marco Büchler, Computational Analysis of Text Reuse in Shenoute and Besa

Mr. Uwe Sikora, Text Encoding – Opportunities and Challenges

Ms. Eliese-Sophia Lincke, Optical Character Recogition (OCR) for Coptic. Testing Automated Digitization of Texts with OCRopy

 

Friday 11-12:30 Workshop on Coptic Fonts & Coptic Bible (AA)

Christian Askeland, Frank Feder

Friday 4:30-6 Digital Tools for Beginners (Workshop on Coptic SCRIPTORIUM)

Caroline T. Schroeder, Amir Zeldes, Rebecca S. Krawiec

Full, machine-annotated New Testament Corpus updated

We’ve updated and re-released our fully machine-annotated New Testament corpus.  sahidica.nt V2.1.0 contains the Sahidica NT text from Warren Wells Sahidica online NT, with the following features:

  • Annotated with our latest NLP tools (part of speech tagger 1.9, tokenizer 4.1.0, language tagger and lemmatizer include lexical entries from the Database and Dictionary of Greek Loanwords in Coptic (DDGLC))
  • Now contains the morph layer (annotating compound words and Coptic morphs such ⲣⲉϥ- ⲙⲛⲧ- ⲁⲧ-)
  • Visualizations for linguistic analysis

Please keep in mind that this fully machine-annotated corpus is more accurate than previous versions but will nonetheless contain more errors than a corpus manually corrected by a human.

Search and queries

For searches and queries using our ANNIS database to find specific terms, for this corpus we recommend searching the normalized words using regular expressions (to capture instances of the desired word that may still be embedded in a Coptic bound group, instances that our tokenizer may have missed):

Lemma searches are now also possible.  You may wish to search for the lemma using regular expressions, as well, in order to find lemmas of some compound words.  For example, the following search will find entries containing ⲥⲱⲧⲙ in the lemma:

The results include various forms of ⲥⲱⲧⲙ (including ⲥⲟⲧⲙ) lemmatized the lexical entry “ⲥⲱⲧⲙ“, compound words lemmatized to ⲥⲱⲧⲙ or to a lexical entry containing ⲥⲱⲧⲙ, and some bound groups containing the word form ⲥⲱⲧⲙ, which our tokenizer did not catch:

Frequency table of normalized words lemmatized to swtm or a lemma form containing swtm (May 2016 Sahidica corpus)

Frequency table of normalized words lemmatized to ⲥⲱⲧⲙ or a lemma form containing ⲥⲱⲧⲙ (May 2016 Sahidica corpus)

As you can see, most of the hits are accurate (e.g., ⲥⲟⲧⲙ, ⲁⲧⲥⲱⲧⲙ, ⲣⲁⲧⲥⲱⲧⲙ, ⲣⲉϥⲥⲱⲧⲙ); some of the Coptic bound groups did not tokenize properly (e.g., ⲉⲡⲥⲱⲧⲙ, ⲙⲁⲣⲟⲩⲥⲱⲧⲙ).  We expect accuracy to increase as we incorporate more texts into our corpora that have been machine annotated and then manually edited.

Reading by individual chapter

You can also read these documents and see the linguistic analysis visualizations at data.copticscriptorium.org/urn:cts:copticLit:nt.  The first documents you will see (Gospel of Mark, 1 Corinthians) are manually annotated.  Scroll down for “New Testament,” which is the full, machine-annotated Sahidica New Testament.  Click on “Chapter” to read each chapter as normalized Coptic (with English translation as a pop-up when you hover your cursor).  Click on “Analytic” for the normalized Coptic, part of speech analysis, and English translation for each chapter.  Please keep in mind the English translation provided is a free, open-access New Testament translation from the World English Bible; it is not a direct translation from the Coptic.

Note:  we know that our server is slow generating the documents for this corpus.  It may take several minutes to load; please be patient.  For faster access, use ANNIS.  Visualizations to read the chapters are available by clicking on the corpus and the icon for visualizations.

Accessing document visualizations of the Sahidica corpus via ANNIS

Accessing document visualizations of the Sahidica corpus via ANNIS

We hope this corpus is useful to researchers.

Hiring: Digital Humanities Specialist for KELLIA and U Pacific Library

Digital Humanities Specialist at the University of the Pacific

The University of the Pacific seeks to hire a creative and collaborative Digital Humanities Specialist (DHS) to develop and manage strategies and infrastructure for curating digital and pre-digital content and data; provide computer programming support for projects; and author and/or co-author new digital humanities resources or scholarship.  This is a full-time 20-24 month pilot staff position. The DHS will work half-time contributing to the University Library’s archival and digital initiatives and half-time on an interdisciplinary NEH-funded Digital Humanities research project, KELLIA.  The DHS will report to Prof. Caroline T. Schroeder in the Department of Religious Studies and Michael Wurtz, the Head of Special Collections.

[Apply for this position at the University of the Pacific website]

KELLIA (Koptische/Coptic Electronic Language and Literature International Alliance) is an international DH project funded by the NEH and the DFG (Germany) to develop international standards and promote digital scholarship in the language and literature of ancient Egypt.  Researchers at the University of the Pacific, Georgetown University, Goettingen University, and Muenster University will be collaborating on digital methods in textual studies, linguistics, history, and manuscript studies.

The William Knox Holt Memorial Library on the Stockton campus serves a diverse community of liberal arts and professional faculty.  The Holt-Atherton Special Collections is home to several important American cultural heritage collections:  the multimedia archives of jazz legend Dave Brubeck; primary source documents from World War II Japanese-American Internment Camps; the papers of renowned naturalist and conservationist John Muir; and the papers and video archive of former San Francisco Mayor George R. Moscone.

Duties

The Digital Humanities Specialist may perform some but not all of the following duities and/or may be assigned additional duties:

  1. Develops and manages strategies and infrastructure for curating digital humanities content and data.
  2. Authors/co-authors new digital humanities resources or scholarship.
  3. Provides web development and programming for humanities research.
  4. Contributes to original research in digital humanities.
  5. Contributes to planning and decision-making about KELLIA’s technological development and long-term sustainability.
  6. Identifies, recommends, and implements linked open data technologies for humanities research.
  7. Identifies, recommends, and implements digital asset management and digital archiving in the Library.
  8. Participates in archival processing and reference duties in a special collections environment.  
  9. Designs forward-facing, interactive digital initiatives, websites, and/or exhibits.
  10. Provides library and special collections instruction.

 

QUALIFICATIONS:

Education/Work Experience/Certifications:

  • 1) MA in Digital Humanities OR 2) MLIS from an accredited ALA program or MA in Archival Studies with demonstrated digital/technological training/certification OR 3) MA in a Humanities discipline or related field with demonstrated digital/technological training or certification
  • Documented research and/or teaching experience in digital scholarship or pedagogy in a humanities discipline or related field
  • Demonstrated experience in web development and programming for research and/or teaching in the humanities or a related field (including archival studies and library and information science)

Skills/Knowledge and Expertise:

Required skills/knowledge and expertise

  • Excellent interpersonal, presentation, and communication skills
  • Demonstrated expertise in digital humanities technologies of web development (HTML, CSS, PHP, JavaScript), text encoding (XML), and programming (Python, Java)
  • Commitment to open access technologies and data for the humanities or a related field
  • Proven ability to work collaboratively in team-based initiatives
  • Proven ability to contribute to original scholarship in the humanities or a related field
  • Enthusiasm to build international and interdisciplinary research partnerships
  • Proven ability to work successfully with diverse populations and demonstrated commitment to promote and enhance diversity and inclusion
  • Knowledge of ancient languages, while welcome, is not a requirement for this position.

Preferred skills/knowledge and expertise

  • Demonstrated expertise with data curation techniques for a variety of digitized and born-digital media (text, code, images, music, etc.) and tools (e.g., DSpace, EPrints, Fedora, contentDM, etc.)
  • Demonstrated experience with linked data technologies and methodologies (e.g., JSON, RDF)
  • Experience managing CMS and LMS systems
  • Command of archival theory and best practices, especially as they relate to the particular issues posed by born-digital content.  

APPLICATION:

To apply for this position visit https://pacific.peopleadmin.com/postings/5822 and submit:

  • Letter of interest
  • CV
  • Names and contact information for 3 references

Review of applications will begin on September 1.

Questions about the position may be directed to cschroeder@pacific.edu and mwurtz@pacific.edu.  For questions about the online application process, please consult the online help system.

This position is funded by the University of the Pacific Library and the National Endowment for the Humanities (through the joint NEH-DFG bilateral Digital Humanities grant program).

New American-German DH Collaboration: KELLIA

Koptische/Coptic Electronic Language and Literature International Alliance is a collaboration between Coptic SCRIPTORIUM, the Göttingen Coptic Old Testament Project , and other partners. KELLIA has been awarded a joint NEH-DFG bilateral grant for sharing data and technologies and for developing common standards in Coptic DH.

 

Kellia photo

From Les Kellia. Ermitages coptes en Basse Égypte. Genève: Musèe d’art et d’histoire de Genève, 1989

© 2017

Theme by Anders NorenUp ↑