TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
11 Mar 2023 13:02 UTC christopheparisse/​evalang Données partagées pour le projet Evalang fra, eng 6403
09 Feb 2023 08:49 UTC telota/​jean_​paul_​briefe Daten der digitalen Edition "Jean Paul – Sämtliche Briefe digital" - 6437
30 May 2020 05:30 UTC centre-​for-​humanities-​computing/​grundtvig-​data Data repository for all data related to the grundtvig center dan 6551
25 Feb 2023 09:45 UTC pruizf/​disco Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 19th century spa 6616
30 May 2020 05:30 UTC uvalib/​ead-​utils Tools used to process and ingest EAD xml finding aids into the repository and solr. eng, spa, fra 6687
30 May 2020 05:30 UTC srophe/​persons Public Respository for Syriaca persons projects, including authorities, hagiography, and prosopography syr, eng, ara, fra, deu, lat, rus, ita, ell, grc 6711
30 May 2020 05:30 UTC giladax/​digi-​proj-​GUI - heb 6854
30 May 2020 05:30 UTC providedh/​ACDH_​Salzburg_​recipes Parser for the XML Recipes deu 7038
30 May 2020 05:30 UTC ananana/​scientific_​authorship_​data - eng 7195
30 May 2020 05:30 UTC MorielV/​Digital-​Humanities-​-​-​Ass2 parsing song lyrics in python. heb 7202
30 May 2020 05:30 UTC Tech-​Leaderboard/​nips_​scraper Scrape from https://papers.nips.cc/ eng, som, spa, sqi, dan, por 7234
12 Aug 2021 12:58 UTC opentorah/​alter-​rebbe.​org Digital archive of the early history of Chabad; 19 Kislev Archive. rus, heb, deu, pol, fra, lit, lat, yid 7267
28 Dec 2022 10:45 UTC dsldk/​diplomatarium-​danicum Data sources for Diplomatarium Danicum lat, gmh, xda, eng, deu, nld, xno, fra, gml, nil, gda, sme, dan, reg, swe, dum 7474
08 Feb 2023 06:49 UTC dsldk/​herman-​bang Data til projektet Herman Bangs breve. dan, fra 7789
30 May 2020 05:30 UTC swift-​poems-​project/​swift-​transcripts Transcripts for the Swift Poems Project eng 7835
30 May 2020 05:30 UTC OpenGreekAndLatin/​Teubner3-​grc-​dev - - 7946
30 May 2020 05:30 UTC grtkachenko/​SimpleSearchEngine - - 7972
03 Oct 2022 09:03 UTC hlapin/​eRabbinica - eng, heb, cop, ara 8033
05 Apr 2023 13:11 UTC Handrit/​Manuscripts Icelandic Manuscript descriptions using TEI P5 isl, lat, nor, dan, eng, deu, non, swe, nds, ita, dum, nld 8166
30 May 2020 05:30 UTC hinrikur/​MLT201F-​stylometry Stylometric analysis of short anonymous texts in Icelandic. - 8254
30 May 2020 05:30 UTC swift-​poems-​project/​tei-​transcripts - eng 8669
28 Aug 2020 08:32 UTC srophe/​draft-​data Repository for TEI records in development. eng, zho, ara, syr, fra, lat, deu, rus, cop, chu, spa, gez, grc, hye, ita, kat, nld, por, sog, ell 8684
30 May 2020 05:30 UTC christiancasey/​iip-​word-​lists Python utility for creating word lists from epidoc files arc, grc, heb, lat 8729
30 May 2020 05:30 UTC cltk/​chinese_​text_​cbeta_​01 Chinese Buddhist scriptures from CBETA san, zho, eng 8749
30 May 2020 05:30 UTC IraPS/​Tolstoy_​letters_​and_​diaries - - 8811
30 May 2020 05:30 UTC cltk/​chinese_​text_​cbeta_​02 Chinese Buddhist scriptures from CBETA pli, san, eng, zho, x-unknown 8982
30 May 2020 05:30 UTC BD2K-​Aztec/​Aztec-​TextSummarizing - eng, nor 9929
30 May 2020 05:30 UTC uvalib/​valleyshadow Migration of valleyshadow.lib from Tomcat/Coccoon to Go eng 10117
05 Apr 2023 11:45 UTC bodleian/​medieval-​mss Medieval Manuscripts in Oxford Libraries: TEI catalogue descriptions lat, eng, ita, deu, fra, nld, grc, spa, zxx, gle, cym, ell, cor, chu, cat, ces, heb, pro, por, cop, egy, ara, rus, isl, fry, und, cai, gmh, goh, ang, enm, xno 10595
30 May 2020 05:30 UTC jawalsh/​tei_​text - eng, fra, deu, lat, grc, spa, afr, ita, nld, por, heb, hin, ara, gai, haw, ota, nai, sco, tur, zho, nor, pol, rus, ell, msa, ton, arc, ang, tam, tah, jpn, gae, rom, sve, alg, urd, dan, fij, isl, pli, cym 10915
02 Apr 2023 19:45 UTC Brown-​University-​Library/​iip-​texts IIP inscriptions encoded in Epidoc XML and supporting files arc, grc, heb, lat, phn, kat, syc, xcl 11011
31 Dec 2020 08:59 UTC welfare-​state-​analytics/​riksdagen-​corpus-​old Preprocess the proceedings of the Swedish parliament eng, slv 11208
14 May 2022 17:45 UTC srophe/​srophe-​app-​data Repository for Syriaca.org TEI data, used by srophe-eXist-app. ara, cop, chu, deu, eng, spa, fra, gez, grc, hye, ita, kat, lat, nld, por, rus, sog, syr, ell 12026
20 Jan 2023 13:45 UTC welfare-​state-​analytics/​riksdagen-​corpus - eng, slv 12030
15 Jan 2023 05:43 UTC tolstoydigital/​TEI All of Tolstoy in TEI/XML rus 12119
30 May 2020 05:30 UTC lascivaroma/​pompei-​inscriptions Corpus of Inscriptions from Pompei, including graffitis. lat 12209
20 Apr 2021 12:59 UTC rism-​ch/​onstage-​tei TEI files fed into onstage - 12548
30 May 2020 05:30 UTC CivilWarGovernorsOfKentucky/​TestDocuments Repository for testing automatic Github integration - 12776
30 Mar 2023 19:46 UTC BetaMasaheft/​Persons Authority files for each person eng, gez, fra, ara, heb, grc, cop, por, ita, rus, amh, deu, tir, lat, syr 13765
05 Apr 2023 16:55 UTC fihristorg/​fihrist-​mss Fihrist TEI Catalogue ara, fas, jpr, pus, und, tur, urd, eng, pan, heb, ota, hin, kur, deu, lat, chg, fra, gez, syr, syc, cop, grc, map, snd, san, mal, jrb, uig, prs, ave, ita, swa, msa, por, hau, inc, amh, zxx 13967
30 May 2020 05:30 UTC TEI-​EAJ/​auto_​aozora_​tei 青空文庫テキストのTEI自動化プロジェクト - 15156
30 May 2020 05:30 UTC obdurodon/​CollateOS Machine-assisted collation and alignment of diplomatic transcriptions of medieval Slavic manuscripts - 15872
05 Apr 2023 10:47 UTC BetaMasaheft/​Manuscripts Manuscripts descriptions eng, gez, amh, ara, tir, lat, ita, ces, deu, fra, heb, cop, grc, rus, syr, spa, hye 16260
21 Jun 2022 15:49 UTC acdh-​oeaw/​schnitzler-​tagebuch-​data Quelldaten zum Tagebuch (1879–1931) von Arthur Schnitzler - 16426
11 Oct 2021 20:37 UTC cbeta-​git/​CBR2X-​XML CBETA CBReader 2X XML eng, zho, pli, san, x-unknown 20392
30 May 2020 05:30 UTC BetaMasaheft/​coordinates Records with coordinates from previous gazetteer. Spelling and transcription rules not usable eng, gez 21846
23 Jun 2021 20:36 UTC sul-​dlss/​dlme-​metadata Harvested metadata for the Digital Library of the Middle East project ara, heb, jrb, syc, arc, jpr, lad, grc, lat, fas, yej, yid 23744
30 May 2020 05:30 UTC pruizf/​disco-​ms Scripts to reproduce results of our DSH paper about the DISCO corpus spa 24605
06 Apr 2023 05:46 UTC CivilWarGovernorsOfKentucky/​Documents CWGK Documents in TEI-XML Format - 24956
05 Dec 2022 20:45 UTC DARIAH-​SI/​siParl Slovenian parliamentary corpus slv, eng 25471