Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 23 May 2026 00:03 UTC | dracor-org/poldracor | PolDraCor (Polish Drama Corpus) | - | 1 |
| 23 May 2026 00:03 UTC | distantreading/WG1 | Discussion documents and working papers from WG1 | ita, eng, deu, spa, lat, srp, tur, fra, bul | 58 |
| 23 May 2026 00:03 UTC | dracor-org/fredracor | French Drama Corpus | fra | 2190 |
| 23 May 2026 00:03 UTC | aso2101/satavahana-inscriptions-data | Data for Sātavāhana Inscriptions project | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr, pra, san, mar, tel, kan, und | 714 |
| 23 May 2026 00:03 UTC | HuygensING/textmod | Topic Modeling | fra, nld | 1 |
| 23 May 2026 00:03 UTC | sarit/SARIT-corpus | The e-texts of the SARIT project | san, eng, pra | 82 |
| 23 May 2026 00:03 UTC | dracor-org/shakedracor | Folger Shakespeare Library. Shakespeare's Plays from Folger Digital Texts. Transformed for the Dracor pipeline to prepare network analysis. | eng, fra, ita, lat, spa | 38 |
| 20 May 2026 06:56 UTC | yighu/gsword_grails_3 | gsword_grails_3 | - | 1 |
| 18 May 2026 13:13 UTC | UHH-Tamilex/TamilneriVilakkam | - | tam | 1 |
| 18 May 2026 13:13 UTC | blakearchive/data | - | - | 43 |
| 18 May 2026 13:13 UTC | thanneken/tei2edition | Create a scholarly edition with multiple visualizations from a single TEI file | lat | 1 |
| 15 May 2026 01:48 UTC | pulibrary/BlueMountain | Project to digitize avant-garde periodicals | - | 10 |
| 13 May 2026 11:32 UTC | krisgrint/james-mill | XML-encoded transcriptions of James Mill's commonplace books | - | 1 |
| 13 May 2026 11:32 UTC | ouvroir/chronology | A digital museology chronology | - | 2 |
| 13 May 2026 11:32 UTC | UHH-Tamilex/lexicon | The dictionary in progress | eng | 2 |
| 12 May 2026 10:27 UTC | nakagawanatuko/TEI_genji | - | - | 1 |
| 12 May 2026 10:27 UTC | bartnich/Thesis_2024 | Thèse 2024 | - | 2 |
| 12 May 2026 10:27 UTC | slavdict/slavdict_corpus | Корпус текстов на церковно-славянском, представленный в формате HIP, предназначенный для создания словаря церковно-славянского языка (все тексты взяты с www.orthlib.ru) | - | 2 |
| 12 May 2026 10:27 UTC | icaruseu/mom-ca | Monasterium.net (http://www.monasterium.net/mom) - repository and collaborative archive | - | 2 |
| 12 May 2026 10:27 UTC | yitzhtal/digital-humanities-course-assignments | - | - | 1 |
| 12 May 2026 10:27 UTC | lb42/theCellar | archival storage of TEI memorabilia | - | 97 |
| 12 May 2026 10:27 UTC | evaaaaaaa894/fleabagTS.xslt | - | - | 1 |
| 12 May 2026 10:27 UTC | gupett/DD2430 | - | - | 1 |
| 12 May 2026 10:27 UTC | mandellc320/PoetessArchive | All the documents in the Poetess Archive | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr | 164 |
| 10 May 2026 15:53 UTC | aurimasv/zotero-import-export-formats | Import/Export Formats supported by Zotero | - | 1 |
| 10 May 2026 15:53 UTC | nebulagazer9/Remediating-Saducismus-triumphatus | Group project | - | 1 |
| 10 May 2026 12:34 UTC | ADHO/dh2016 | Abstracts from the DH2016 conference in Kraków. | - | 431 |
| 10 May 2026 12:34 UTC | ghukill/wsudor-rebus-abc | An annotated digital edition created with Readux | - | 1 |
| 10 May 2026 12:34 UTC | Beth-Mardutho/hugoye-data | Data repository for Hugoye TEI records. | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr | 649 |
| 10 May 2026 12:34 UTC | Anterotesis/historical-texts | Collections of english historical texts and data relating to them | lat, eng, sco, fra, cym, frm, roa, deu, ita, mul, zxx, grc, fro, nld, spa, pau | 32851 |
| 08 May 2026 11:16 UTC | aspd-2501/aspd-2501.github.io | - | eng, spa | 1 |
| 08 May 2026 11:16 UTC | daedstudios/read-papers-fast | - | eng | 1 |
| 08 May 2026 11:16 UTC | patchinplace/Barnes | Corpus visualisation of Barnes's unpublished poetry | - | 1 |
| 08 May 2026 11:16 UTC | aap2299/aap2299.github.io | - | - | 1 |
| 08 May 2026 11:16 UTC | CLKCrompton/lglc-practice | - | - | 9 |
| 04 May 2026 10:55 UTC | HannaVarilek/The-Journalist | ENGL-478-878 Digital Archives & Editions Final Project | - | 9 |
| 03 May 2026 18:52 UTC | UHH-Tamilex/Manimekalai | - | tam | 3 |
| 03 May 2026 18:52 UTC | technologies-of-history/dyngley-data | A repository containing the transcriptions and metadata for a digital edition of TCC MS O.8.35. | - | 1 |
| 01 May 2026 01:46 UTC | tthub-repo/ejercicios | Ejercicios en TTHUB | - | 1 |
| 01 May 2026 01:46 UTC | lb42/tei-fr | Automatically exported from code.google.com/p/tei-fr | frm, lat, ita, spa, deu, sco, mul, ell, oci, eus, nld, dan, heb, jpa, ara, oar, pcd, fra, por, bre, tup, eng, cat, grc, bul, lit, bel, pol, srp, sqi, zho, rus, und, isl, kat, all, cel, non, jpn, xml | 925 |
| 24 Apr 2026 15:16 UTC | jmclawson/DHQ_TM | Digital Humanities Quarterly Topic Modeling | eng, fra, lat, ita | 201 |
| 24 Apr 2026 15:16 UTC | wolfleozhang/Verovio-DrumNotation | Verovio for DrumNotation | eng | 1 |
| 23 Apr 2026 21:46 UTC | mv96/mm_extraction | This repository contains the code and pointer to the trained models to extract proofs and theorems from scientific articles | eng, fra, deu, spa, ita, pol | 3 |
| 17 Apr 2026 00:28 UTC | dracor-org/neolatdracor | Neo-Latin Drama Corpus | - | 1 |
| 02 Mar 2026 10:36 UTC | burchards-dekret-digital/website | - | lat | 2 |
| 12 Dec 2025 00:02 UTC | Anais-Leucht/Frankenstein_template-1- | - | - | 1 |
| 20 Nov 2025 09:10 UTC | megz4/Frankensteins-page | final project for Text as Data course 2024 | - | 10 |
| 25 Aug 2025 19:57 UTC | SusanBrown/LEAF-Writer-demo-texts | Texts for use with the XML and RDF editor of the Linked Editing Academic Framework Virtual Research Environment (LEAF-VRE) found at https://leaf-writer.leaf-vre.org/ | deu, fra, eng | 5 |
| 11 Aug 2025 21:59 UTC | nerd-bible/tanach.us | Unofficial version control for Leningrad Codex transcription. | heb | 2 |
| 01 Aug 2025 17:12 UTC | openbible-io/tanach.us | Version control for https://www.tanach.us/Books/Tanach.xml.zip | heb | 7 |