TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
25 Nov 2022 16:57 UTC BetaMasaheft/​Authority-​Files Places, People and Taxonomies for Manuscripts and Works eng, amh, gez, ara, ita 602
30 May 2020 05:30 UTC Zeta-​and-​Company/​ELTeC-​Sets Data-Sets for Zeta-Project (100 romans for each of 6 languages) deu, eng, fra, hun, por, slv 596
22 Sep 2020 20:32 UTC envomp/​2020-​Text-​Mining - - 596
03 Feb 2022 20:36 UTC OpenArabicPE/​newspaper_​al-​ittihad-​al-​uthmani Bibliographic metadata for the Arabic newspaper *al-Ittiḥād al-ʿUthmānī* (الاتحاد العثماني), published by Aḥmad Ḥasan Ṭabbāra in Beirut, 1908--10 ara 595
30 Mar 2023 19:46 UTC dracor-​org/​gerdracor German Drama Corpus deu 593
09 Mar 2023 22:47 UTC erc-​dharma/​tfa-​pallava-​epigraphy DHARMA Task Force A Tamil Nadu, South India, Pallava corpus san, tam, fra, eng 588
21 Jul 2020 08:31 UTC arjanski/​gregorovius-​test - - 587
11 Dec 2022 03:47 UTC scta-​texts/​bHY6yh Geremia da Montagnone Compendium moralium notabilium lat 583
26 Mar 2026 06:18 UTC srophe/​syriac-​corpus This is the development repository for The Oxford-BYU Syriac Corpus project. ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr 575
30 May 2020 05:30 UTC jcboyd/​pykelet Master thesis 2015 eng, fra, deu, spa, ita, pol 574
30 May 2020 05:30 UTC LOGARANDES/​logardata - - 570
30 May 2020 05:30 UTC aviamble/​TestAutomation - eng 565
30 May 2020 05:30 UTC solirom/​dlr-​data-​old - ron, lat, fra, srp, ukr, hun, tur, ell, spa, eng, deu, ita, sla, rus, rom, bul, sqi, ces, grc 563
10 Mar 2021 04:40 UTC lguariento/​Curious_​Travellers - eng, ita, fra, lat, cym, gla, grc 558
07 Feb 2023 16:54 UTC emt-​project/​emt-​transkribus-​export Repo for exporting data from Transkribus deu 552
30 May 2020 05:30 UTC SIstory/​Verlustliste - slv, ita, deu, hrv, slk 552
27 Mar 2023 01:57 UTC HistoryAtState/​frus Foreign Relations of the United States - TEI XML source files - 547
23 Sep 2021 08:41 UTC OpenArabicPE/​journal_​al-​manar Digital edition (TEI XML) of Rashīd Riḍā's journal al-Manār (المنار) ara 544
30 May 2020 05:30 UTC demery/​csvify_​tei - eng 544
29 Aug 2025 17:58 UTC VandyVRC/​tcadrt - ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr, zho 541
27 May 2022 14:44 UTC scta-​texts/​FrMS88 Francis de Meyronnes Sentences Commentary lat 540
15 Sep 2021 08:40 UTC lg14/​DH-​Projekt-​Kessler - eng 540
25 Jan 2022 01:44 UTC Tamarae/​ecg-​efes Epigraphic Corpus of Georgian in EFES kat, grc, hye 537
21 Oct 2022 10:57 UTC PatristicTextArchive/​pta_​manuscripts Database of manuscript descriptions - 537
25 May 2021 05:16 UTC nevenjovanovic/​croatiae-​auctores-​latini-​textus XML texts of Croatian Latin authors (published as CroALa digital collection) lat 536
19 Jan 2024 21:47 UTC ParthenosWP4/​SSK Development of the Standardization Survival Kit eng, fra, lat, ell, srp, isl, cym, dan, lit, fro, heb, sqi, non, slv, ava, deu, spa, ita, kor, zho, x-lap, jpn 536
05 Apr 2023 07:46 UTC performant-​software/​mel-​website Melville Electronic Library Website - 531
30 May 2020 05:30 UTC pingtzuchu/​ConfucainClassics Confucain Classics Project - 526
14 Sep 2020 16:32 UTC quadrama/​Corpus The main quadrama corpus deu 526
03 May 2021 13:00 UTC Tamarae/​Corpus საქართველოს ეპიგრაფიკული კორპუსი kat, eng, grc, heb, arc, hye, lat, rus, chu, ara, fra, deu, ell, ita 525
30 May 2020 05:30 UTC croqueGrec09/​KarlstadtTicketMachine This is a test for -whenever I have time- setting up a Jenkins for GentzApp/KtM-deploy - 523
30 May 2020 05:30 UTC nevenjovanovic/​cts-​croala Convert Neo-Latin XML editions from CroALa to CTS / CITE Architecture lat 523
30 May 2020 05:30 UTC anuvivn/​wd-​2 - bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr, fas 523
28 Mar 2023 17:48 UTC OBVIL/​mercure-​galant OBVIL, mercure-galant, édition complète fra 523
15 Dec 2020 04:53 UTC MoizAhmedd/​ntlk_​data downloaded ntlk data bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr, fas 523
20 Oct 2020 08:37 UTC katabase/​reconciliation - fra, eng 518
11 Nov 2022 13:28 UTC ArchivesNationalesFR/​editionTestamentsDePoilus The TEI files that form the 'Testaments de Poilus' digital edition fra 512
30 May 2020 05:30 UTC JonathanReeve/​corpus-​SHC-​SimpleSHCStandard A submodule of the Shakespeare His Contemporaries corpus with standardized spelling. eng, unk 509
30 May 2020 05:30 UTC ccl0326/​nltk_​data [py] nltk.download() pol, eng 508
30 May 2020 05:30 UTC oriflamms/​PsautierIMS Data for study of space between words in Psalm 101 - 507
04 Feb 2023 09:45 UTC KfNGOe/​ferdinand-​I-​data - deu, eng 506
30 Sep 2022 21:51 UTC srophe/​srophe-​xQueries xQuery scripts written for use with Syriaca.org data (not bundled with the eXist app) grc, lat, syr, eng, ara, fra, deu 505
30 May 2020 05:30 UTC Lizfeng/​Content-​Analysis-​2020 Assignments for Computational Content Analysis 2020 bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr 505
30 Mar 2023 21:46 UTC CDRH/​data_​teaa Data Repository for To Enter Africa from America fra, eng 504
08 Jul 2021 20:36 UTC fflah/​reksis - - 502
04 Jul 2020 08:31 UTC OSH-​2020/​GDBFS x-code-nowww created by GitHub Classroom - 502
18 Oct 2021 08:41 UTC lin380/​tadr Text as Data Resources - 501
30 May 2020 05:30 UTC bxie/​ai2_​analysis Data Analysis for App Inventor - 501
30 May 2020 05:29 UTC jorfsson/​chatbot Chatbot practice - 501
30 May 2020 05:30 UTC vikramraodp/​virginia - - 501