Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 25 Nov 2022 16:57 UTC | BetaMasaheft/Authority-Files | Places, People and Taxonomies for Manuscripts and Works | eng, amh, gez, ara, ita | 602 |
| 30 May 2020 05:30 UTC | Zeta-and-Company/ELTeC-Sets | Data-Sets for Zeta-Project (100 romans for each of 6 languages) | deu, eng, fra, hun, por, slv | 596 |
| 22 Sep 2020 20:32 UTC | envomp/2020-Text-Mining | - | - | 596 |
| 03 Feb 2022 20:36 UTC | OpenArabicPE/newspaper_al-ittihad-al-uthmani | Bibliographic metadata for the Arabic newspaper *al-Ittiḥād al-ʿUthmānī* (الاتحاد العثماني), published by Aḥmad Ḥasan Ṭabbāra in Beirut, 1908--10 | ara | 595 |
| 30 Mar 2023 19:46 UTC | dracor-org/gerdracor | German Drama Corpus | deu | 593 |
| 09 Mar 2023 22:47 UTC | erc-dharma/tfa-pallava-epigraphy | DHARMA Task Force A Tamil Nadu, South India, Pallava corpus | san, tam, fra, eng | 588 |
| 21 Jul 2020 08:31 UTC | arjanski/gregorovius-test | - | - | 587 |
| 11 Dec 2022 03:47 UTC | scta-texts/bHY6yh | Geremia da Montagnone Compendium moralium notabilium | lat | 583 |
| 26 Mar 2026 06:18 UTC | srophe/syriac-corpus | This is the development repository for The Oxford-BYU Syriac Corpus project. | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr | 575 |
| 30 May 2020 05:30 UTC | jcboyd/pykelet | Master thesis 2015 | eng, fra, deu, spa, ita, pol | 574 |
| 30 May 2020 05:30 UTC | LOGARANDES/logardata | - | - | 570 |
| 30 May 2020 05:30 UTC | aviamble/TestAutomation | - | eng | 565 |
| 30 May 2020 05:30 UTC | solirom/dlr-data-old | - | ron, lat, fra, srp, ukr, hun, tur, ell, spa, eng, deu, ita, sla, rus, rom, bul, sqi, ces, grc | 563 |
| 10 Mar 2021 04:40 UTC | lguariento/Curious_Travellers | - | eng, ita, fra, lat, cym, gla, grc | 558 |
| 07 Feb 2023 16:54 UTC | emt-project/emt-transkribus-export | Repo for exporting data from Transkribus | deu | 552 |
| 30 May 2020 05:30 UTC | SIstory/Verlustliste | - | slv, ita, deu, hrv, slk | 552 |
| 27 Mar 2023 01:57 UTC | HistoryAtState/frus | Foreign Relations of the United States - TEI XML source files | - | 547 |
| 23 Sep 2021 08:41 UTC | OpenArabicPE/journal_al-manar | Digital edition (TEI XML) of Rashīd Riḍā's journal al-Manār (المنار) | ara | 544 |
| 30 May 2020 05:30 UTC | demery/csvify_tei | - | eng | 544 |
| 29 Aug 2025 17:58 UTC | VandyVRC/tcadrt | - | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr, zho | 541 |
| 27 May 2022 14:44 UTC | scta-texts/FrMS88 | Francis de Meyronnes Sentences Commentary | lat | 540 |
| 15 Sep 2021 08:40 UTC | lg14/DH-Projekt-Kessler | - | eng | 540 |
| 25 Jan 2022 01:44 UTC | Tamarae/ecg-efes | Epigraphic Corpus of Georgian in EFES | kat, grc, hye | 537 |
| 21 Oct 2022 10:57 UTC | PatristicTextArchive/pta_manuscripts | Database of manuscript descriptions | - | 537 |
| 25 May 2021 05:16 UTC | nevenjovanovic/croatiae-auctores-latini-textus | XML texts of Croatian Latin authors (published as CroALa digital collection) | lat | 536 |
| 19 Jan 2024 21:47 UTC | ParthenosWP4/SSK | Development of the Standardization Survival Kit | eng, fra, lat, ell, srp, isl, cym, dan, lit, fro, heb, sqi, non, slv, ava, deu, spa, ita, kor, zho, x-lap, jpn | 536 |
| 05 Apr 2023 07:46 UTC | performant-software/mel-website | Melville Electronic Library Website | - | 531 |
| 30 May 2020 05:30 UTC | pingtzuchu/ConfucainClassics | Confucain Classics Project | - | 526 |
| 14 Sep 2020 16:32 UTC | quadrama/Corpus | The main quadrama corpus | deu | 526 |
| 03 May 2021 13:00 UTC | Tamarae/Corpus | საქართველოს ეპიგრაფიკული კორპუსი | kat, eng, grc, heb, arc, hye, lat, rus, chu, ara, fra, deu, ell, ita | 525 |
| 30 May 2020 05:30 UTC | croqueGrec09/KarlstadtTicketMachine | This is a test for -whenever I have time- setting up a Jenkins for GentzApp/KtM-deploy | - | 523 |
| 30 May 2020 05:30 UTC | nevenjovanovic/cts-croala | Convert Neo-Latin XML editions from CroALa to CTS / CITE Architecture | lat | 523 |
| 30 May 2020 05:30 UTC | anuvivn/wd-2 | - | bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr, fas | 523 |
| 28 Mar 2023 17:48 UTC | OBVIL/mercure-galant | OBVIL, mercure-galant, édition complète | fra | 523 |
| 15 Dec 2020 04:53 UTC | MoizAhmedd/ntlk_data | downloaded ntlk data | bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr, fas | 523 |
| 20 Oct 2020 08:37 UTC | katabase/reconciliation | - | fra, eng | 518 |
| 11 Nov 2022 13:28 UTC | ArchivesNationalesFR/editionTestamentsDePoilus | The TEI files that form the 'Testaments de Poilus' digital edition | fra | 512 |
| 30 May 2020 05:30 UTC | JonathanReeve/corpus-SHC-SimpleSHCStandard | A submodule of the Shakespeare His Contemporaries corpus with standardized spelling. | eng, unk | 509 |
| 30 May 2020 05:30 UTC | ccl0326/nltk_data | [py] nltk.download() | pol, eng | 508 |
| 30 May 2020 05:30 UTC | oriflamms/PsautierIMS | Data for study of space between words in Psalm 101 | - | 507 |
| 04 Feb 2023 09:45 UTC | KfNGOe/ferdinand-I-data | - | deu, eng | 506 |
| 30 Sep 2022 21:51 UTC | srophe/srophe-xQueries | xQuery scripts written for use with Syriaca.org data (not bundled with the eXist app) | grc, lat, syr, eng, ara, fra, deu | 505 |
| 30 May 2020 05:30 UTC | Lizfeng/Content-Analysis-2020 | Assignments for Computational Content Analysis 2020 | bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr | 505 |
| 30 Mar 2023 21:46 UTC | CDRH/data_teaa | Data Repository for To Enter Africa from America | fra, eng | 504 |
| 08 Jul 2021 20:36 UTC | fflah/reksis | - | - | 502 |
| 04 Jul 2020 08:31 UTC | OSH-2020/GDBFS | x-code-nowww created by GitHub Classroom | - | 502 |
| 18 Oct 2021 08:41 UTC | lin380/tadr | Text as Data Resources | - | 501 |
| 30 May 2020 05:30 UTC | bxie/ai2_analysis | Data Analysis for App Inventor | - | 501 |
| 30 May 2020 05:29 UTC | jorfsson/chatbot | Chatbot practice | - | 501 |
| 30 May 2020 05:30 UTC | vikramraodp/virginia | - | - | 501 |