Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 30 May 2020 05:30 UTC | ColeDCrawford/test-xml | - | eng | 11 |
| 15 Sep 2021 08:40 UTC | lg14/DH-Projekt-Kessler | - | eng | 540 |
| 30 May 2020 05:30 UTC | cmohge1/riga-text-analysis | Repo for a two-week intro to text analysis course at Riga Technical University (16-26 Sep 2019). | eng | 17 |
| 25 Sep 2021 16:57 UTC | Lemoneezy/EmilyHobhouse | - | eng | 1 |
| 30 May 2020 05:29 UTC | stevaras2/Poster-Generation-Demo | - | eng | 16 |
| 16 May 2021 08:47 UTC | apache/tika | The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). | eng | 2 |
| 10 Oct 2020 12:38 UTC | cmosher01/Tei-To-Xhtml5 | Generate XHTML5 web pages from TEI formatted documents | eng | 4 |
| 30 May 2020 05:30 UTC | cmosher01/teish | Simple TEI to HTML converter | eng | 8 |
| 11 Oct 2021 01:42 UTC | jeddobson/ENGL64.05-21F | Repository for Fall term Cultural Analytics course | eng | 234 |
| 08 Jan 2024 11:47 UTC | ericleasemorgan/text-analysis-eebo | Prototype text analysis against the EEBO collection | eng | 97 |
| 30 May 2020 05:29 UTC | Sebastian1984/tika | - | eng | 1 |
| 18 Mar 2022 08:41 UTC | lamps-lab/TableAndFigureMentionsExtractor | - | eng | 1 |
| 30 May 2020 05:30 UTC | CDRH/diachronic | Diachronic Markup Project | eng | 17 |
| 16 Dec 2020 20:38 UTC | conbainbridge/COMM220_USE_nlp_project | Natural language processing of essays in the USE corpus, for COMM 220 final project. | eng | 1 |
| 03 Apr 2023 02:51 UTC | cmm2209/Problemata | - | eng | 6 |
| 30 May 2020 05:29 UTC | zentrum-lexikographie/e-Lexicography-2019 | Course materials for the compact course in digital lexicography held at the University of Potsdam | eng | 2 |
| 26 May 2022 17:49 UTC | lidija-jovanovska/ner-dm-algorithms | A project for testing Named Entity Recognition (NER) models on data mining algorithms data. | eng | 5 |
| 26 Jan 2022 04:52 UTC | jbg5721/forTestingAndSchool | https://jbg5721.github.io/forTestingAndSchool/ | eng | 6 |
| 30 May 2020 05:29 UTC | ContentMine/cm-ucl | A repository to openly track progress on table extraction. | eng | 2 |
| 30 May 2020 05:29 UTC | curationexperts/tufts_models | Hydra models for Tufts | eng | 4 |
| 20 Mar 2022 20:37 UTC | JonathanReeve/jonreeve.com-ema | My personal website, jonreeve.com, rewritten using Ema. | eng | 1 |
| 06 Apr 2021 20:41 UTC | inspirehep/inspire-next | The INSPIRE repo. | eng | 4 |
| 23 May 2022 08:51 UTC | hawc2/gatsby_bo | - | eng | 6 |
| 23 Aug 2022 06:11 UTC | jdmartin/eltec-text-splitter | Chunk English Novels Into Chapters | eng | 35 |
| 30 May 2020 05:30 UTC | charlietaylor98/vangogh-gang | - | eng | 703 |
| 30 May 2020 05:30 UTC | SteveNewman1970/TEIBeggar-sOperaDrafts | - | eng | 2 |
| 30 May 2020 05:30 UTC | taxrolls/taxrolls.github.io | The Tax Rolls of Medieval Paris Digital Edition Project | eng | 9 |
| 30 May 2020 05:29 UTC | sarthfrey/slurp | A library of spelling correction algorithms. | eng | 1 |
| 30 May 2020 05:29 UTC | strangeloop/lambdajam2013 | Lambda Jam 2013 | eng | 1 |
| 30 May 2020 05:29 UTC | superneo/NLP_toy_spell_checker | An implementation of the English spell checker of Peter Norvig. | eng | 1 |
| 30 Aug 2021 04:49 UTC | CDRH/data_civilwardc | Data Repository for Civil War Washington | eng | 3778 |
| 11 Sep 2020 08:32 UTC | walshbr/humanists-nlp-cookbook | Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python." | eng | 1 |
| 30 May 2020 05:29 UTC | stefano-bragaglia/Corrector | Familiarising with Norvig code for spell correction | eng | 1 |
| 30 May 2020 05:29 UTC | suzuki-akira3/tokenizer | - | eng | 2 |
| 30 May 2020 05:29 UTC | TuftsUniversity/MIRA | Tufts Admin interface | eng | 4 |
| 30 May 2020 05:29 UTC | cmohge1/lrbs-scholarly-editing | Central repository for the 2018 Digital Scholarly Editing module at the Institute of English Studies | eng | 5 |
| 23 Aug 2022 17:47 UTC | jeddobson/ENGL64.05-22F | Repository for ENGL 64.05/QSS 30.16 Cultural Analytics (Fall 2022) at Dartmouth College | eng | 82 |
| 30 May 2020 05:29 UTC | sebastianrahtz/TEIXSL-v1 | Family of TEI stylesheets written in XSLT 1.0. | eng | 8 |
| 23 Aug 2021 16:59 UTC | ag-gipp/parallelXmlHighlighting | - | eng | 5 |
| 30 May 2020 05:29 UTC | spenteco/big_file | Testing the LFS thing. | eng | 1 |
| 19 Jan 2021 13:26 UTC | wenamun/notes-from-egypt | eXist-db webapplication/publication of letters from 19th century Egypt | eng | 147 |
| 30 May 2020 05:30 UTC | social-energy-atlas/georgia-municipal-codes | Georgia Municipal Code Dataset | eng | 283 |
| 08 May 2022 18:49 UTC | GutenbergSource/35557-Metelerkamp-Outa-Karels-Stories | TEI master file of Sanni Metelerkamp (1867–1945): Outa Karel’s Stories. | eng, afr | 1 |
| 30 May 2020 05:29 UTC | GutenbergSource/60794-Herkimer-The-Story-of-the-Typewriter | TEI master file of The Story of the Typewriter by the Herkimer County Historical Society | eng, afr, ara, bul, bik, bre, cat, ceb, cym, ces, dan, dak, deu, ell, epo, spa, esx, eus, fas, fin, fra, fry, gle, gla, grc, glv, haw, heb, hin, hrv, hun, hye, ilo, ido, isl, ita, jpn, kar, lat, lad, lit, lav, mag, mlg, mri, mar, msa, mlt, mwr, mya, nah, nld, nor, oci, pag, pam, pol, por, roh, ron, rus, rue, san, slk, slv, sqi, srp, sot, swe, tgl, tur, tat, urd, vie, win, xho, yid, yua, zul | 1 |
| 11 Jan 2021 05:14 UTC | livingstoneonline/LEAP-MT | - | eng, afr, grc, nld, fra, gla, lat, ota, por, tsn, und, ara, bnt, hin, loz, lun, mck, sot, fas, toi, swh, arb, mlg, nym, lea, tur, heb, grk | 108 |
| 08 Jan 2025 11:53 UTC | livingstoneonline/LEAP-TEI | All the TEI files for Livingstone Online | eng, ajw, ara, fra, hin, lat, swh, und, por, gla, grk, grc, mlg, nym, fas, sco, arb, heb, lea, tur, afr, nld, ita, tsn, ell, deu, ota, bnt, loz, lun, mck, sot, toi, swa | 1020 |
| 14 Dec 2020 01:33 UTC | BetaMasaheft/makepdf | make pdf repo | eng, amh, gez | 2 |
| 28 Oct 2021 08:42 UTC | BetaMasaheft/BetMas | Exist-db application of the Beta Masaheft project | eng, amh, gez | 18 |
| 25 Nov 2022 16:57 UTC | BetaMasaheft/Authority-Files | Places, People and Taxonomies for Manuscripts and Works | eng, amh, gez, ara, ita | 602 |
| 30 May 2020 05:30 UTC | wvbe/shakespeare-to-the-max | - | eng, ang, ces, lat, fra, ell, deu, grc, ara, nld, grk, heb, ita, spa, swe, tur, enm, gmh, cym | 2723 |