Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 25 Mar 2021 20:41 UTC | reading-in-the-alps/rita2-data | data processing/cleaning repo | - | 97 |
| 10 Feb 2023 19:45 UTC | gabays/RiseAndFall | RiseAndFall: genres and centroids | fra | 97 |
| 09 May 2022 07:42 UTC | slovotolstogo/TEI | - | - | 97 |
| 09 Jan 2023 06:47 UTC | cschuwey/labruyere_ubs | - | fra, lat | 96 |
| 09 Apr 2022 18:47 UTC | roboioritz/EmotionRecognitionFlaskServer | A Flask Server with multiple emotion related stuff | pol | 96 |
| 19 Feb 2023 13:44 UTC | ldsad7/visual_novels | Visual novels from rusdracor: https://dracor.org/rus | rus | 96 |
| 09 Nov 2022 04:34 UTC | kermitt2/grobid-quantities | GROBID extension for identifying and normalizing physical quantities. | eng | 96 |
| 08 Nov 2022 23:53 UTC | pruizf/rhyme-within-reason | - | spa | 96 |
| 21 Dec 2022 15:46 UTC | HistoryAtState/frus-history | Source data for History of the Foreign Relations Series | - | 96 |
| 30 May 2020 05:30 UTC | sarahfbeller/finalProject | - | lat, eng, deu | 96 |
| 30 May 2020 05:30 UTC | ebalzac/FC | Furne corrigé | fra | 95 |
| 17 Feb 2021 12:48 UTC | bodleian/georgian-mss | Georgian Collection TEI Catalogue | kat, eng, rus, fra, xcl | 95 |
| 02 Mar 2023 10:49 UTC | ucsdltcs180/exercises-week-1-Sumayyatron | exercises-week-1-Sumayyatron created by GitHub Classroom | eng, lat | 95 |
| 13 May 2022 17:07 UTC | CERES-Sorbonne/ImagesDataViz | Several experiments to try to picture the diffusion of images on social networks | fra | 95 |
| 30 May 2020 05:30 UTC | ChiaraPalladino/Digital-Agathemerus | Digital Agathemerus Project: data | grc, lat, ell, eng, fra, deu, nld | 95 |
| 14 Sep 2022 08:58 UTC | stephemi/rc-tei_FIX | - | - | 95 |
| 28 May 2022 21:44 UTC | Migabaj/yarkho | Digital Adaptation of Boris Yarkho's Research on the Works of Pierre Corneille | fra | 95 |
| 30 May 2020 05:30 UTC | oriflamms/Psautiers | - | - | 95 |
| 20 Jan 2023 04:48 UTC | jdmenke/prediabetes_doc_classifier | This relates to my prediabetes classification work on biomedical manuscripts using annotations from prior meta-analyses. | eng | 94 |
| 26 Jun 2022 18:47 UTC | SigiDoc/SigiDoc_Latest | Latest SigiDoc Stylesheet AS - MF | eng, fra, grc, ita, lat, rus, ell, sqi, hye, deu, chu, ara, heb | 94 |
| 26 May 2021 17:58 UTC | spinfo/AvHChrono | Language processing tools to enrich to chronology of Alexander von Humboldt (Data from BBAW) | - | 94 |
| 30 May 2020 05:30 UTC | letiziaricci/tirocinio | - | ita, fra, des | 94 |
| 19 Jan 2021 13:26 UTC | rstarlin/text-processor | A text processor that searches pdf or xml files for a set of known target words. | eng | 93 |
| 30 May 2020 05:30 UTC | EAGLE-BPN/epidocupconversion | XSLT to convert string epigraphic texts in marked up TEI-EPIDOC XML | ara, eng, fra, deu, grc, ell, heb, ita, lat | 93 |
| 30 May 2020 05:30 UTC | mikolajserafin/serafin | Correspondence of Mikołaj Serafin | lat, pol, ita | 93 |
| 04 Apr 2023 16:50 UTC | nevenjovanovic/latty-cts | A CTS version of Tyrolean Neo-Latin texts (Latinitas Tyrolensis) | lat | 92 |
| 27 Jan 2022 01:39 UTC | saurabhsen24/Final_Year_ML_Project | - | - | 92 |
| 30 May 2020 05:30 UTC | araborn/pessoa | - | por, eng | 92 |
| 09 May 2021 20:39 UTC | DHd-Verband/DHd-Abstracts-2017 | - | - | 92 |
| 14 Oct 2021 20:37 UTC | erc-dharma/tfb-daksinakosala-epigraphy | DHARMA project Task Force B, Dakṣiṇa Kosala epigraphic corpus being prepared by Natasja Bosma. | eng, san | 92 |
| 29 Mar 2023 11:45 UTC | TEIC/atop | Another TEI ODD Processor | fra, eng, por | 92 |
| 30 May 2020 05:30 UTC | severinsimmler/figur-dev | WORK IN PROGRESS | - | 90 |
| 30 May 2020 05:30 UTC | IraPS/rusdracor_topic_modeling | Topic Modeling 200 Years of Russian Drama | rus | 90 |
| 08 Jan 2024 11:47 UTC | DigitalLatin/DLL-Stylesheets | Contains a fork of the TEI Stylesheets for transforming TEI XML into various formats. This fork is a customization for the Digital Latin Library's Library of Digital Latin Texts. | eng, lat, ell, srp, isl, cym, dan, lit, fro, heb, sqi, non, slv, ava, fra, deu, spa, ita, kor, zho, x-lap, jpn | 90 |
| 04 Feb 2021 08:42 UTC | phoenix-mossimo/Comprehensive-Coptic-Lexicon-Research-Data | Comprehensive Coptic Lexicon - Research Data | deu, eng, fra, grc, cop | 90 |
| 11 Feb 2023 04:49 UTC | COST-ELTeC/ELTeC-spa | Spanish novels for the European Literary Text Collection (ELTeC) | spa, lat, eng, fra, deu, gig, ita, jpn, ara, glg, nld, cat, eus, grc, nor | 90 |
| 18 Mar 2023 16:50 UTC | galenus-verbatim/galenus_exports | Temporary repository to work Galen XML/TEI | grc, eng, lat | 90 |
| 30 Jul 2022 21:42 UTC | OpenArabicPE/journal_lughat-al-arab | Digital edition (TEI XML) of Anastās Mārī al-Karmalī's monthly journal *Lughat al-ʿArab* (لغة العرب), published in Baghdad, 1911--. | ara | 90 |
| 17 Feb 2023 05:45 UTC | scta-texts/ol96b1 | repo for Peter Auriol Scriptum | lat | 90 |
| 05 Dec 2022 19:44 UTC | clause-bielefeld/keywordscape | KeywordScape - Visual Document Exploration using Contextualized Keyword Embeddings | eng | 89 |
| 25 Sep 2020 12:46 UTC | Vitaliy-1/citParser | - | - | 89 |
| 30 May 2020 05:30 UTC | adunning/bedes-bible | Bede's Bible: An Edition of the Latin Vulgate from the Codex Amiatinus | lat | 89 |
| 30 May 2020 05:30 UTC | DCLP/text-incorporation | working repos for programmatic insertion of texts into xml files | eng, grc | 89 |
| 10 Jun 2022 04:04 UTC | UTKcataloging/civilwar_remediation | Remediation of the American Civil War Collection following migration from TEI to MODS. | - | 89 |
| 30 May 2020 05:30 UTC | seretan/seretan.github.io | - | - | 88 |
| 30 May 2020 05:30 UTC | bpwilcox/bw-projects | A collection of robotics, control, and machine learning relevant projects over the years | - | 88 |
| 29 Dec 2020 17:27 UTC | chsteiner/cantus | - | - | 88 |
| 19 May 2021 04:55 UTC | agile-humanities/ddhi-repository | A development repository for the DDHI Oral History Project. | eng | 88 |
| 30 May 2020 05:30 UTC | ucdh/Roy-Bruce-TEI | - | eng, fra, ara | 87 |
| 19 Apr 2021 01:44 UTC | nishkalavallabhi/LING410X-Spring18 | "Language as Data" course materials | - | 86 |