TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
30 May 2020 05:30 UTC ColeDCrawford/​test-​xml - eng 11
15 Sep 2021 08:40 UTC lg14/​DH-​Projekt-​Kessler - eng 540
30 May 2020 05:30 UTC cmohge1/​riga-​text-​analysis Repo for a two-week intro to text analysis course at Riga Technical University (16-26 Sep 2019). eng 17
25 Sep 2021 16:57 UTC Lemoneezy/​EmilyHobhouse - eng 1
30 May 2020 05:29 UTC stevaras2/​Poster-​Generation-​Demo - eng 16
16 May 2021 08:47 UTC apache/​tika The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). eng 2
10 Oct 2020 12:38 UTC cmosher01/​Tei-​To-​Xhtml5 Generate XHTML5 web pages from TEI formatted documents eng 4
30 May 2020 05:30 UTC cmosher01/​teish Simple TEI to HTML converter eng 8
11 Oct 2021 01:42 UTC jeddobson/​ENGL64.​05-​21F Repository for Fall term Cultural Analytics course eng 234
08 Jan 2024 11:47 UTC ericleasemorgan/​text-​analysis-​eebo Prototype text analysis against the EEBO collection eng 97
30 May 2020 05:29 UTC Sebastian1984/​tika - eng 1
18 Mar 2022 08:41 UTC lamps-​lab/​TableAndFigureMentionsExtractor - eng 1
30 May 2020 05:30 UTC CDRH/​diachronic Diachronic Markup Project eng 17
16 Dec 2020 20:38 UTC conbainbridge/​COMM220_​USE_​nlp_​project Natural language processing of essays in the USE corpus, for COMM 220 final project. eng 1
03 Apr 2023 02:51 UTC cmm2209/​Problemata - eng 6
30 May 2020 05:29 UTC zentrum-​lexikographie/​e-​Lexicography-​2019 Course materials for the compact course in digital lexicography held at the University of Potsdam eng 2
26 May 2022 17:49 UTC lidija-​jovanovska/​ner-​dm-​algorithms A project for testing Named Entity Recognition (NER) models on data mining algorithms data. eng 5
26 Jan 2022 04:52 UTC jbg5721/​forTestingAndSchool https://jbg5721.github.io/forTestingAndSchool/ eng 6
30 May 2020 05:29 UTC ContentMine/​cm-​ucl A repository to openly track progress on table extraction. eng 2
30 May 2020 05:29 UTC curationexperts/​tufts_​models Hydra models for Tufts eng 4
20 Mar 2022 20:37 UTC JonathanReeve/​jonreeve.​com-​ema My personal website, jonreeve.com, rewritten using Ema. eng 1
06 Apr 2021 20:41 UTC inspirehep/​inspire-​next The INSPIRE repo. eng 4
23 May 2022 08:51 UTC hawc2/​gatsby_​bo - eng 6
23 Aug 2022 06:11 UTC jdmartin/​eltec-​text-​splitter Chunk English Novels Into Chapters eng 35
30 May 2020 05:30 UTC charlietaylor98/​vangogh-​gang - eng 703
30 May 2020 05:30 UTC SteveNewman1970/​TEIBeggar-​sOperaDrafts - eng 2
30 May 2020 05:30 UTC taxrolls/​taxrolls.​github.​io The Tax Rolls of Medieval Paris Digital Edition Project eng 9
30 May 2020 05:29 UTC sarthfrey/​slurp A library of spelling correction algorithms. eng 1
30 May 2020 05:29 UTC strangeloop/​lambdajam2013 Lambda Jam 2013 eng 1
30 May 2020 05:29 UTC superneo/​NLP_​toy_​spell_​checker An implementation of the English spell checker of Peter Norvig. eng 1
30 Aug 2021 04:49 UTC CDRH/​data_​civilwardc Data Repository for Civil War Washington eng 3778
11 Sep 2020 08:32 UTC walshbr/​humanists-​nlp-​cookbook Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python." eng 1
30 May 2020 05:29 UTC stefano-​bragaglia/​Corrector Familiarising with Norvig code for spell correction eng 1
30 May 2020 05:29 UTC suzuki-​akira3/​tokenizer - eng 2
30 May 2020 05:29 UTC TuftsUniversity/​MIRA Tufts Admin interface eng 4
30 May 2020 05:29 UTC cmohge1/​lrbs-​scholarly-​editing Central repository for the 2018 Digital Scholarly Editing module at the Institute of English Studies eng 5
23 Aug 2022 17:47 UTC jeddobson/​ENGL64.​05-​22F Repository for ENGL 64.05/QSS 30.16 Cultural Analytics (Fall 2022) at Dartmouth College eng 82
30 May 2020 05:29 UTC sebastianrahtz/​TEIXSL-​v1 Family of TEI stylesheets written in XSLT 1.0. eng 8
23 Aug 2021 16:59 UTC ag-​gipp/​parallelXmlHighlighting - eng 5
30 May 2020 05:29 UTC spenteco/​big_​file Testing the LFS thing. eng 1
19 Jan 2021 13:26 UTC wenamun/​notes-​from-​egypt eXist-db webapplication/publication of letters from 19th century Egypt eng 147
30 May 2020 05:30 UTC social-​energy-​atlas/​georgia-​municipal-​codes Georgia Municipal Code Dataset eng 283
08 May 2022 18:49 UTC GutenbergSource/​35557-​Metelerkamp-​Outa-​Karels-​Stories TEI master file of Sanni Metelerkamp (1867–1945): Outa Karel’s Stories. eng, afr 1
30 May 2020 05:29 UTC GutenbergSource/​60794-​Herkimer-​The-​Story-​of-​the-​Typewriter TEI master file of The Story of the Typewriter by the Herkimer County Historical Society eng, afr, ara, bul, bik, bre, cat, ceb, cym, ces, dan, dak, deu, ell, epo, spa, esx, eus, fas, fin, fra, fry, gle, gla, grc, glv, haw, heb, hin, hrv, hun, hye, ilo, ido, isl, ita, jpn, kar, lat, lad, lit, lav, mag, mlg, mri, mar, msa, mlt, mwr, mya, nah, nld, nor, oci, pag, pam, pol, por, roh, ron, rus, rue, san, slk, slv, sqi, srp, sot, swe, tgl, tur, tat, urd, vie, win, xho, yid, yua, zul 1
11 Jan 2021 05:14 UTC livingstoneonline/​LEAP-​MT - eng, afr, grc, nld, fra, gla, lat, ota, por, tsn, und, ara, bnt, hin, loz, lun, mck, sot, fas, toi, swh, arb, mlg, nym, lea, tur, heb, grk 108
08 Jan 2025 11:53 UTC livingstoneonline/​LEAP-​TEI All the TEI files for Livingstone Online eng, ajw, ara, fra, hin, lat, swh, und, por, gla, grk, grc, mlg, nym, fas, sco, arb, heb, lea, tur, afr, nld, ita, tsn, ell, deu, ota, bnt, loz, lun, mck, sot, toi, swa 1020
14 Dec 2020 01:33 UTC BetaMasaheft/​makepdf make pdf repo eng, amh, gez 2
28 Oct 2021 08:42 UTC BetaMasaheft/​BetMas Exist-db application of the Beta Masaheft project eng, amh, gez 18
25 Nov 2022 16:57 UTC BetaMasaheft/​Authority-​Files Places, People and Taxonomies for Manuscripts and Works eng, amh, gez, ara, ita 602
30 May 2020 05:30 UTC wvbe/​shakespeare-​to-​the-​max - eng, ang, ces, lat, fra, ell, deu, grc, ara, nld, grk, heb, ita, spa, swe, tur, enm, gmh, cym 2723