TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
06 Dec 2021 04:50 UTC csae8092/​bahr2arche - deu 985
23 Apr 2022 23:43 UTC claraimc/​ANVERSO - - 987
30 Mar 2023 23:45 UTC scta-​texts/​lombardsententia - lat, eng 994
08 Dec 2021 20:40 UTC acdh-​oeaw/​Hermann-​Bahr_​Arthur-​Schnitzler - deu 995
30 Sep 2021 08:40 UTC Getinet309/​Manuscripts - eng, gez 1000
20 Sep 2021 08:40 UTC himmeproject/​practices Practices data for the Historical Index of the Medieval Middle East - 1000
26 Jun 2021 04:48 UTC FrederikeNeuber/​jeanpaulanalytics Playground! Data and scripts for applying methods of Social Media Analytics to a corpus of letters from Jean Paul's family, friends and colleagues - 1000
10 Nov 2021 08:42 UTC lichaozhu/​projet_​encyclopedie_​tourisme - fra 1000
30 May 2020 05:30 UTC utkdigitalinitiatives/​spc-​tei TEI XML and corresponding MODS records - 1008
08 Jan 2025 11:53 UTC livingstoneonline/​LEAP-​TEI All the TEI files for Livingstone Online eng, ajw, ara, fra, hin, lat, swh, und, por, gla, grk, grc, mlg, nym, fas, sco, arb, heb, lea, tur, afr, nld, ita, tsn, ell, deu, ota, bnt, loz, lun, mck, sot, toi, swa 1020
30 May 2020 05:30 UTC koper921/​TC3-​Classification-​Oeuvres-​Theatres - fra 1027
30 May 2020 05:30 UTC issahammoud/​French-​plays-​classification - fra 1029
30 May 2020 05:30 UTC sandark95/​https-​github.​com-​issahammoud-​French-​plays-​classification - fra 1029
14 Nov 2020 08:32 UTC mchesterkadwell/​named-​entity-​recognition Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities, June-July 2020 - 1032
11 Feb 2021 01:15 UTC cmroueche/​OLD_​IRT_​EFES - grc, eng, lat, rus, ara, fra, deu, ell, heb, ita, phn, ber, chu 1044
30 May 2020 05:30 UTC wolf257/​m1s2-​documents-​structures - fra 1052
30 May 2020 05:30 UTC neozhangthe1/​canonicity_​old A light weight library for natural language understanding and knowledge base canonicalization. - 1060
26 May 2022 21:44 UTC oeaw-​ministerratsprotokolle/​mp-​edition-​data Repo for TEI XML data from the `Ministerratsprotokolle` edition project that has been published on the web already - 1070
30 May 2020 05:29 UTC lpschaub/​projets_​M1 projets en RI et EI fra 1073
30 May 2020 05:30 UTC OBVIL/​sainte-​beuve-​philologic - fra 1087
10 Feb 2023 10:49 UTC dsldk/​tycho-​brahe - lat, dan, deu 1091
11 Apr 2022 08:47 UTC vikhil0609/​quickcompanyGrobid - eng 1092
30 May 2020 05:30 UTC TEI-​examples/​tei-​examples Examples of TEI documents dealing with different use-cases. fra, eng, x-oldcam, san, ara, deu, grc, ell, heb, ita, lat, phn, ber, spa, cop 1106
15 May 2021 04:56 UTC LeReMed/​XML - - 1116
19 Feb 2021 12:48 UTC MAPerformance/​MAP_​XML TEI Data for the MAP project - 1116
30 May 2020 05:30 UTC telpirion/​ClassicsFormattingTools Tools for formatting open source classical works eng, lat, deu, fra, ita, tur, spa, pie 1119
30 May 2020 05:30 UTC uvalib/​text-​collections Restriction-free TEI texts eng, fra, lat, ell, deu, grk, grc, heb, ita, rus, san, moh 1125
30 May 2020 05:30 UTC ParkerWeb/​descMD Descriptive Metadata for Parker on the Web - 1125
29 Nov 2024 15:55 UTC LukeMurphey/​perseus-​greek-​and-​roman-​texts Contains various Greek and Roman works of antiquity that were originally provided as part of the Perseus project (http://www.perseus.tufts.edu/). eng, lat, deu, fra, ita, tur, spa, pie 1127
30 May 2020 05:30 UTC telotahelpdesk/​backup_​home - fra, deu 1127
30 May 2020 05:30 UTC nkallen/​perseus-​greco-​roman Perseus Greek & Roman texts eng, lat, deu, fra, ita, tur, spa, pie 1127
26 Nov 2020 12:39 UTC holmesr19/​VILLA_​data Raw data from Perseus & the cleaning and prep work to make the VILLA database eng, lat, deu, fra, ita, tur, spa, pie 1127
30 May 2020 05:30 UTC kabojnk/​perseus-​parsers - eng, lat, deu, fra, ita, tur, spa, pie 1128
29 Oct 2020 12:47 UTC Princeton-​CDH/​mapping-​expatriate-​paris Encoding library cards from Sylvia Beach's Shakespeare and Company - 1157
30 May 2020 05:30 UTC FrankensteinVariorum/​fv-​postCollation a repository for post-processing finalized collation files to prepare the Variorum edition. - 1166
30 May 2020 05:30 UTC OpenPhilology/​canonical OPP work iterating PerseusDL eng, lat, deu, fra, ita, tur 1169
30 May 2020 05:30 UTC rillian/​cdli-​cts Canonical Text Services export of the Cuneiform Digital Library Initiative corpus. eng, sux, akk, tso 1178
04 Apr 2023 20:45 UTC OpenGreekAndLatin/​First1KGreek XML files for the works in the First Thousand Years of Greek Project. Please see our Wiki on how to contribute. grc, lat, deu, eng, cop, nld, fra, mul, ita, ell 1182
07 Apr 2023 08:45 UTC erc-​dharma/​tfc-​khmer-​epigraphy This repository assembles data produced by the project Corpus des inscriptions khmères (before and during the DHARMA project). san, eng, x-oldkhmer, lat, khm, pra, fra, pli 1191
01 Feb 2021 17:22 UTC dhhse/​Mandelstam_​digital_​archive - - 1195
30 May 2020 05:30 UTC pervosled/​Mandelstam_​TEI - - 1195
30 May 2020 05:30 UTC mbwolff/​Classique-​inconnu Reinvent classic French plays fra 1209
30 May 2020 05:30 UTC cligs/​toolbox Collection of small tools for text processing. ita, fra 1215
30 May 2020 05:30 UTC bodleian/​senmai-​mss - shn 1217
01 Jul 2021 17:02 UTC OBVIL/​apollinaire - fra 1218
30 May 2020 05:30 UTC DARIAH-​SI/​CLARIN.​SI - slv, eng 1219
15 Nov 2020 16:40 UTC howisonlab/​softcite-​dataset An annotated dataset of software mentions in scholarly articles. eng, fra, deu, por, spa, ara 1248
26 Dec 2021 20:39 UTC ONCOJ/​data Periodic release of the Oxford NINJAL Corpus of Old Japanese (ONCOJ) - 1268
30 May 2020 05:30 UTC PolMine/​GermaParlTEI GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format) - 1294
11 Nov 2022 05:08 UTC whanley/​ilcorpus Various resources for International Law corpus project - 1300