TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
30 May 2020 05:30 UTC Formulae-​Litterae-​Chartae/​formulae-​open Public-domain texts from the Formulae - Litterae - Chartae Project deu, lat 2691
30 May 2020 05:30 UTC wvbe/​shakespeare-​to-​the-​max - eng, ang, ces, lat, fra, ell, deu, grc, ara, nld, grk, heb, ita, spa, swe, tur, enm, gmh, cym 2723
30 May 2020 05:30 UTC conail/​wordtree The Word Tree (Coventry) - 2761
15 Jan 2021 05:19 UTC lehkost/​ToolXtractor Extract tools from TEI-encoded abstracts against a matching list. eng, ita, fra, spa, som, lat 2791
30 May 2020 05:30 UTC CDRH/​cocoon_​encyclopedia Encyclopedia of the Great Plains - 2796
30 May 2020 05:30 UTC horace-​qiao/​usep-​data - lat, grc 2820
30 May 2020 05:30 UTC spapando/​usep-​dataxx - lat, grc 2841
02 Feb 2023 22:44 UTC whitmanarchive/​whitman-​scribal Data Repo | Whitman Scribal TEI - 2947
30 May 2020 05:30 UTC lascivaroma/​cil004-​03000 Set of inscriptions from CIL and Pompei. CIL 4 3000 to 5999 lat 2968
11 Feb 2021 01:15 UTC Princeton-​CDH/​bluemountain-​transcriptions TEI-encoded transcriptions of Blue Mountain materials. fra, dan, ita, deu, rus, ces, eng, pol, swe 2977
19 Dec 2022 14:44 UTC faustedition/​faust-​xml XML and other source data of the Faustedition - 2997
30 May 2020 05:30 UTC sologebre/​Works - eng, ara, ita, gez, grc, amh, kat, pal, syr, cop, lat 2998
30 May 2020 05:30 UTC Mitch-​C/​Colenso-​Project - - 3006
30 May 2020 05:30 UTC MisterTJB/​Colenso-​Project A web application providing access to documents forming the Colenso Project – a repository of New Zealand letters encoded with TEI - 3006
30 May 2020 05:30 UTC JEMHcorpus/​corpora Repository for releases of corpora in JEMH - 3055
30 May 2020 05:30 UTC Dans-​labs/​annotation-​paradigm Save queries as annotations. Demo for the WIVU database of the Hebrew Bible - 3090
30 May 2020 05:30 UTC cwulfman/​bluemountain-​transcriptions TEI-encoded transcriptions of Blue Mountain materials. fra, dan, ita, deu, rus, ces, eng, pol, swe 3094
11 Apr 2023 10:46 UTC abaevdict/​abaevdict-​tei The TEI version of the Abaev dictionary rus, eng, oss, x-oldirn, xpr, kat, ira, ine, trk, fas, tgk, mon, sva, kur, che, abk, abq, grc, kjj, bbl, ava, kbd, lez, lbe, xln, hun, ukr, non, ofs, nor, swe, dan, lat, sqj, deu, inc, yai, sla, x-balochi, ang, gmh, lit, got, x-pamir, goh, wbl, kho, fin, iir, san, ave, pal, sog, xcl, xsc, chu, peo, pus, zza, ita, fra, agx, ara, krc, nog, kum, dar, aqc, tkr, inh, lzz, bre, cor, cym, zkh, smy, oru, prc, mnj, sgh, srh, yah, sgy, xto, txb, sga, hit, qwm, tur, uzb, khw, isk, gml, nld, spa, oci, lav, orv, prg, gle, kom, ady, rom, gem, x-tchr, osx, xmf, ces, mis, ydg, udm, kdr, ccs, uby, pli, x-mordvin, x-rushani, tly, ell, kca, mns, tab, xtq, arc, rut, oos, cau, gbz, alt, ysc, wne, smi, zho, jpn, kap, ani, chv, slv, x-vaynakh, ohu, ckb, x-nuristan, mga, aze, bsk, bul, hbo, udi, hac, xme, pol, xbc, xbo, x-sarm, bat, ddo, bel, est, fro, brh, sh, hin, krl, elx, vep, tat, sah, kaz, akk, ron, txh, ldd, chm, tam, kir, cel, bua, uig, ett, zkz, xld, urd, ben, nep, paq, pan, hau, aer, x-dardic 3098
26 Jan 2023 11:46 UTC nakamura196/​saji - - 3114
30 May 2020 05:30 UTC DHSinology/​Kanripo-​data Kanripo data in XML format - 3131
30 May 2020 05:30 UTC pingtzuchu/​KanripoXML an XML version of Kanripo with eXist-db - 3131
02 Dec 2021 04:49 UTC chartes/​encpos Sources XML/TEI des positions des thèses de l’École des chartes fra 3281
24 Mar 2023 02:00 UTC Brown-​University-​Library/​usep-​data inscriptions and related data files for 'http://library.brown.edu/projects/usep/' lat, grc, phn, und 3323
30 May 2020 05:30 UTC lascivaroma/​cil004-​00000 Set of inscriptions from CIL and Pompei. CIL 4 0 to 2999 lat 3360
30 May 2020 05:29 UTC lascivaroma/​cil004-​06000 Set of inscriptions from CIL and Pompei. CIL 4 6000 to 8999 lat 3381
13 Oct 2021 08:41 UTC redewiedergabe/​corpus a corpus annotated for speech, thought and writing representation - 3599
30 May 2020 05:30 UTC OpenGreekAndLatin/​patrologia_​latina-​dev Machine-corrected versions of selections of the Patrologia Latina. grc, lat, heb 3620
08 Jan 2025 21:52 UTC KislakCenter/​VisColl Modeling and visualizing physical manuscript collation eng 3623
30 May 2020 05:30 UTC olivierjacquot/​patrologia_​latina-​dev - grc, lat, heb, ell 3646
14 Aug 2020 12:32 UTC ebeshero/​newtFire-​webDev for development of the newtFire website at http://newtfire.org eng, ita, fra, lat 3699
30 May 2020 05:30 UTC wsalesky/​srophe-​eXist-​app eXist code for Syriaca.org: The Syriac Reference Portal eng, ara, syr, fra, lat, deu 3763
30 Aug 2021 04:49 UTC CDRH/​data_​civilwardc Data Repository for Civil War Washington eng 3778
01 Aug 2022 18:56 UTC CDRH/​data_​oscys Data Repository for OSCYS project files (TEI, CSV, RDF, etc). Follows CDRH Solr API standards. - 3786
30 May 2020 05:30 UTC heinp/​germarxcor German corpus of marxist texts. deu 3797
30 May 2020 05:30 UTC XQueryInstitute/​Course-​Materials Course Materials for the XQuery Institute eng, fra, ita, lat, spa, ara, syr, deu 3804
30 May 2020 05:30 UTC TEIC/​Hackathon Scripts, data and results for TEI Hackathon - 3807
30 May 2020 05:30 UTC jcowey/​apd.​data Setting up repo for arabic data fra, eng, deu, ita, spa, lat, ell 3846
07 Jun 2024 15:49 UTC kermitt2/​grobid A machine learning software for extracting information from scholarly documents eng, fra, deu, spa, ita, pol 3860
20 Mar 2021 17:13 UTC CDRH/​data_​neihardt Data Repository for Neihardt - 3943
30 May 2020 05:30 UTC TitasNandi/​Summer_​Project This repository has the Answer Selection Challenge Project eng, fra, deu, spa, ita, pol 3947
30 May 2020 05:30 UTC ddbc/​CBETA_​TAFxml 以「量化語言統計」為目的之簡化版漢文佛典 XML,由 CBETA XML P5 (https://github.com/cbeta-org/xml-p5.git) 經程式轉出。 - 3986
30 May 2020 05:30 UTC lmcnulty/​iip-​word-​lists A word list constructor and viewer for the IIP project arc, grc, heb, lat 4022
24 Nov 2021 01:39 UTC antonkarl/​iceErrorCorpus An Icelandic Error corpus, annotated for mistakes related to spelling, grammar, and other issues. - 4046
30 May 2020 05:30 UTC WesScivetti/​Phonesthemes-​Project - eng 4052
30 May 2020 05:30 UTC martinmueller39/​TCP2ESTC Experimental relabeling of TCP texts by decade with aligment to ESTC, including four decades at 40year intervals eng, unk 4139
30 May 2020 05:30 UTC grasshoff/​vorlesung2019 - eng, fra, dan, ita, spa, ces 4202
20 Sep 2020 16:32 UTC pminhtam/​entity-​fishing-​custom - eng, fra, deu, spa, ita, pol 4239
30 May 2020 05:30 UTC JamesWolfe753/​Patrologia-​Latina-​Corrected - grc, lat, heb 4241
30 May 2020 05:30 UTC cbeta-​git/​xml_​p4 CBETA XML P4 pli, san, eng, zho 4399
30 May 2020 05:30 UTC utkdigitalinitiatives/​tdh-​migration TEI migration from P2 SGML/XML to P5. - 4403