Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 30 May 2020 05:30 UTC | Lizfeng/Content-Analysis-2020 | Assignments for Computational Content Analysis 2020 | bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr | 505 |
| 04 Feb 2023 09:45 UTC | KfNGOe/ferdinand-I-data | - | deu, eng | 506 |
| 30 May 2020 05:30 UTC | oriflamms/PsautierIMS | Data for study of space between words in Psalm 101 | - | 507 |
| 30 May 2020 05:30 UTC | ccl0326/nltk_data | [py] nltk.download() | pol, eng | 508 |
| 30 May 2020 05:30 UTC | JonathanReeve/corpus-SHC-SimpleSHCStandard | A submodule of the Shakespeare His Contemporaries corpus with standardized spelling. | eng, unk | 509 |
| 11 Nov 2022 13:28 UTC | ArchivesNationalesFR/editionTestamentsDePoilus | The TEI files that form the 'Testaments de Poilus' digital edition | fra | 512 |
| 20 Oct 2020 08:37 UTC | katabase/reconciliation | - | fra, eng | 518 |
| 28 Mar 2023 17:48 UTC | OBVIL/mercure-galant | OBVIL, mercure-galant, édition complète | fra | 523 |
| 30 May 2020 05:30 UTC | anuvivn/wd-2 | - | bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr, fas | 523 |
| 30 May 2020 05:30 UTC | croqueGrec09/KarlstadtTicketMachine | This is a test for -whenever I have time- setting up a Jenkins for GentzApp/KtM-deploy | - | 523 |
| 30 May 2020 05:30 UTC | nevenjovanovic/cts-croala | Convert Neo-Latin XML editions from CroALa to CTS / CITE Architecture | lat | 523 |
| 15 Dec 2020 04:53 UTC | MoizAhmedd/ntlk_data | downloaded ntlk data | bul, ces, eng, est, hrv, hun, mkd, pol, ron, rus, sh, slk, slv, srp, ukr, fas | 523 |
| 03 May 2021 13:00 UTC | Tamarae/Corpus | საქართველოს ეპიგრაფიკული კორპუსი | kat, eng, grc, heb, arc, hye, lat, rus, chu, ara, fra, deu, ell, ita | 525 |
| 14 Sep 2020 16:32 UTC | quadrama/Corpus | The main quadrama corpus | deu | 526 |
| 30 May 2020 05:30 UTC | pingtzuchu/ConfucainClassics | Confucain Classics Project | - | 526 |
| 05 Apr 2023 07:46 UTC | performant-software/mel-website | Melville Electronic Library Website | - | 531 |
| 19 Jan 2024 21:47 UTC | ParthenosWP4/SSK | Development of the Standardization Survival Kit | eng, fra, lat, ell, srp, isl, cym, dan, lit, fro, heb, sqi, non, slv, ava, deu, spa, ita, kor, zho, x-lap, jpn | 536 |
| 25 May 2021 05:16 UTC | nevenjovanovic/croatiae-auctores-latini-textus | XML texts of Croatian Latin authors (published as CroALa digital collection) | lat | 536 |
| 25 Jan 2022 01:44 UTC | Tamarae/ecg-efes | Epigraphic Corpus of Georgian in EFES | kat, grc, hye | 537 |
| 21 Oct 2022 10:57 UTC | PatristicTextArchive/pta_manuscripts | Database of manuscript descriptions | - | 537 |
| 27 May 2022 14:44 UTC | scta-texts/FrMS88 | Francis de Meyronnes Sentences Commentary | lat | 540 |
| 15 Sep 2021 08:40 UTC | lg14/DH-Projekt-Kessler | - | eng | 540 |
| 17 Apr 2026 00:28 UTC | VandyVRC/tcadrt | - | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr, zho | 541 |
| 23 Sep 2021 08:41 UTC | OpenArabicPE/journal_al-manar | Digital edition (TEI XML) of Rashīd Riḍā's journal al-Manār (المنار) | ara | 544 |
| 30 May 2020 05:30 UTC | demery/csvify_tei | - | eng | 544 |
| 27 Mar 2023 01:57 UTC | HistoryAtState/frus | Foreign Relations of the United States - TEI XML source files | - | 547 |
| 07 Feb 2023 16:54 UTC | emt-project/emt-transkribus-export | Repo for exporting data from Transkribus | deu | 552 |
| 30 May 2020 05:30 UTC | SIstory/Verlustliste | - | slv, ita, deu, hrv, slk | 552 |
| 10 Mar 2021 04:40 UTC | lguariento/Curious_Travellers | - | eng, ita, fra, lat, cym, gla, grc | 558 |
| 30 May 2020 05:30 UTC | solirom/dlr-data-old | - | ron, lat, fra, srp, ukr, hun, tur, ell, spa, eng, deu, ita, sla, rus, rom, bul, sqi, ces, grc | 563 |
| 30 May 2020 05:30 UTC | aviamble/TestAutomation | - | eng | 565 |
| 30 May 2020 05:30 UTC | LOGARANDES/logardata | - | - | 570 |
| 30 May 2020 05:30 UTC | jcboyd/pykelet | Master thesis 2015 | eng, fra, deu, spa, ita, pol | 574 |
| 26 Mar 2026 06:18 UTC | srophe/syriac-corpus | This is the development repository for The Oxford-BYU Syriac Corpus project. | ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr | 575 |
| 11 Dec 2022 03:47 UTC | scta-texts/bHY6yh | Geremia da Montagnone Compendium moralium notabilium | lat | 583 |
| 21 Jul 2020 08:31 UTC | arjanski/gregorovius-test | - | - | 587 |
| 09 Mar 2023 22:47 UTC | erc-dharma/tfa-pallava-epigraphy | DHARMA Task Force A Tamil Nadu, South India, Pallava corpus | san, tam, fra, eng | 588 |
| 17 Apr 2026 00:28 UTC | dracor-org/gerdracor | German Drama Corpus | deu | 593 |
| 03 Feb 2022 20:36 UTC | OpenArabicPE/newspaper_al-ittihad-al-uthmani | Bibliographic metadata for the Arabic newspaper *al-Ittiḥād al-ʿUthmānī* (الاتحاد العثماني), published by Aḥmad Ḥasan Ṭabbāra in Beirut, 1908--10 | ara | 595 |
| 22 Sep 2020 20:32 UTC | envomp/2020-Text-Mining | - | - | 596 |
| 30 May 2020 05:30 UTC | Zeta-and-Company/ELTeC-Sets | Data-Sets for Zeta-Project (100 romans for each of 6 languages) | deu, eng, fra, hun, por, slv | 596 |
| 25 Nov 2022 16:57 UTC | BetaMasaheft/Authority-Files | Places, People and Taxonomies for Manuscripts and Works | eng, amh, gez, ara, ita | 602 |
| 07 Dec 2020 17:07 UTC | katabase/DTS | - | fra, eng | 606 |
| 30 May 2020 05:30 UTC | Data-Science-for-Linguists/Native_and_Non-native_English | Katherine Kairis LING 1340 Term Project | eng, ara, bul, zho, ces, dan, nld, est, fin, fra, deu, hun, ita, jpn, kor, lat, lav, lit, mlt, nor, pol, por, ron, rus, slk, slv, spa, swe, tur, und | 609 |
| 30 May 2020 05:30 UTC | sintakticniSladkorcek/slovenski_parlament | Interesting facts about the sessions of the Slovenian Parliament between the years 1990 and 1992 collected in one place. | slv | 609 |
| 30 May 2020 05:30 UTC | grmek/oo-projekt-1 | - | slv, eng | 610 |
| 30 May 2020 05:30 UTC | Gucekpuhar/parlament | - | slv, eng | 610 |
| 30 May 2020 05:30 UTC | evelyne96/PresentationGen | - | eng, ita | 611 |
| 30 May 2020 05:30 UTC | csae8092/dhd-boas-data | dhd-boas-data stands for DHd Book of Abstracts Data and is a first attempt to collectivetly collect the data of the book of abstracts of the past and current yearly DHd conferences. | eng, deu, fra, ita, spa, por, afr, pol, cat, ind, nld, slv, dan, nor, ron, swe, ces, lit, som, est, swa, vie | 615 |
| 12 Apr 2021 08:48 UTC | calzada/PARLAMINT-ES-MC | - | spa, eng | 619 |