Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 19 Jan 2021 13:26 UTC | rstarlin/text-processor | A text processor that searches pdf or xml files for a set of known target words. | eng | 93 |
| 30 May 2020 05:30 UTC | kshawkin/Best-Practices-for-TEI-in-Libraries | Best Practices for TEI in Libraries: A guide for mass digitization, automated workflows, and promotion of interoperability with XML using the TEI | eng | 2 |
| 15 Jun 2021 04:53 UTC | Denubis/epidoc-xlst-tester | - | eng | 1 |
| 16 Dec 2020 20:38 UTC | conbainbridge/COMM220_USE_nlp_project | Natural language processing of essays in the USE corpus, for COMM 220 final project. | eng | 1 |
| 03 Apr 2023 02:51 UTC | cmm2209/Problemata | - | eng | 6 |
| 14 Dec 2022 11:43 UTC | jd-coderepos/nfdi4ds-sota-shared-task | - | eng | 125 |
| 30 May 2020 05:29 UTC | jeschollaert/Diamondback-Encoding | - | eng | 2 |
| 30 May 2020 05:29 UTC | RuiMao1988/Sequential-Metaphor-Identification | Code for the paper "End-to-End Sequential Metaphor Identification Inspired by Linguistic Theories" | eng | 2 |
| 15 Feb 2023 23:46 UTC | scta/scta-people | - | eng | 1 |
| 30 May 2020 05:29 UTC | Sebastian1984/tika | - | eng | 1 |
| 20 Jan 2023 04:48 UTC | jdmenke/prediabetes_doc_classifier | This relates to my prediabetes classification work on biomedical manuscripts using annotations from prior meta-analyses. | eng | 94 |
| 13 Aug 2021 08:40 UTC | IntheGrass/citeomatic_learning | learn citeomatic code. Add comment/note in source code | eng | 2 |
| 07 Mar 2023 09:47 UTC | WillHolbrook/FYPCodeRepo | The codebase for my final year project | eng | 1 |
| 28 Mar 2021 12:58 UTC | davidxc/warc-pdf-extract | This is a fork of https://git.archive.org/bnewbold/pdf-extract | eng | 3 |
| 08 Jan 2025 21:52 UTC | KislakCenter/VisColl | Modeling and visualizing physical manuscript collation | eng | 3623 |
| 30 Aug 2022 06:41 UTC | ieg-dhr/DigitaleEditorikDMGK | Daten und Lehrmaterial aus dem Modul "Digitale Editorik Historischer Quellen" im DMGK Studiengang Mainz | eng | 7 |
| 30 May 2020 05:29 UTC | Ximenaflores/Ximena-tei-test | - | eng | 1 |
| 30 May 2020 05:29 UTC | Rinijain7/CSCI599HW2 | - | eng | 3 |
| 30 May 2020 05:29 UTC | rdmpage/javascript-jats-xml | Generating JATS XML article markup using Javascript | eng | 1 |
| 23 Aug 2021 16:59 UTC | kermitt2/grobid_client_python | Python client for GROBID Web services | eng | 5 |
| 13 Oct 2020 08:35 UTC | kanripox/Laozi | - | eng | 13 |
| 30 May 2020 05:29 UTC | deepakpunjabi/Quora-Followee-Recommendation | A system to predict & recommend person to follow on Quora based on their personality traits and topic interest similarity. | eng | 1 |
| 30 May 2020 05:29 UTC | sheldonresearch/AdaWalk | Xiangguo Sun, Bo Liu, Qing Meng, Jiuxin Cao, Junzhou Luo, Hongzhi Yin. Group-level Personality Detection based on Text Generated Networks. World Wide Web. 2019. (Accepted, CCF-B, SCI) | eng | 1 |
| 26 Jan 2023 10:46 UTC | JoanGi/Dataset-Reverse-Engineering | Reverse Engineering tool for datasets using NLP techinques | eng | 2 |
| 30 May 2020 05:29 UTC | Redhouane/surya | A python package that upload a set of research articles, parse and summarize the entire corpus or some selected sections (method, results, etc...). | eng | 1 |
| 30 May 2020 05:28 UTC | DominuttiElisa/esercizi-codifica | esercizi | eng | 9 |
| 30 Mar 2023 10:46 UTC | rbgvictoria/vmcp-tei | Von Mueller Correspondence TEI files | eng | 14 |
| 02 Apr 2023 04:47 UTC | iMouth/NLP-Project | - | eng | 4 |
| 10 Dec 2021 08:43 UTC | ReneDorsch/document_extraction_service | - | eng | 7 |
| 06 Feb 2026 18:28 UTC | iangow/personality-1 | - | eng | 1 |
| 10 Dec 2021 08:43 UTC | ReneDorsch/document_annotation_service | - | eng | 3 |
| 10 Sep 2020 08:32 UTC | agile-humanities/ddhi-oht-schema | Dartmouth Digital History Initiative TEI schema customization | eng | 2 |
| 27 Jan 2022 01:39 UTC | rossellaverroca/banksylast20 | Progetto per esame DH | eng | 1 |
| 05 May 2022 07:43 UTC | rstachurski/xml-parsing- | - | eng | 1 |
| 30 May 2020 05:29 UTC | demery/tdw | A set of miscellaneous scripts for working with data from the Digital Walters website. | eng | 1 |
| 09 Mar 2023 07:46 UTC | JCabeza99/UPM-AI-GROBID | Simple grobid client for report generation | eng | 1 |
| 27 Nov 2022 04:49 UTC | radardenker/digilit-sai | Digital Literacy for South Asianists at SAI Heidelberg, materials for the session on GRETIL, TEI for critical editing, and XSLT. | eng | 1 |
| 07 Jun 2024 15:49 UTC | rism-digital/verovio | 🎵 Music notation engraving library for MEI with MusicXML and Humdrum support and various toolkits (JavaScript, Python) | eng | 2 |
| 05 Mar 2023 21:46 UTC | rubenixter/UPM_IA-OS | cosas upm | eng | 3 |
| 17 Feb 2023 07:45 UTC | zentrum-lexikographie/elexicography-WiSe2023 | Course materials for the compact course in digital lexicography held at the University of Potsdam | eng | 5 |
| 08 May 2022 18:49 UTC | GutenbergSource/35557-Metelerkamp-Outa-Karels-Stories | TEI master file of Sanni Metelerkamp (1867–1945): Outa Karel’s Stories. | eng, afr | 1 |
| 30 May 2020 05:29 UTC | GutenbergSource/60794-Herkimer-The-Story-of-the-Typewriter | TEI master file of The Story of the Typewriter by the Herkimer County Historical Society | eng, afr, ara, bul, bik, bre, cat, ceb, cym, ces, dan, dak, deu, ell, epo, spa, esx, eus, fas, fin, fra, fry, gle, gla, grc, glv, haw, heb, hin, hrv, hun, hye, ilo, ido, isl, ita, jpn, kar, lat, lad, lit, lav, mag, mlg, mri, mar, msa, mlt, mwr, mya, nah, nld, nor, oci, pag, pam, pol, por, roh, ron, rus, rue, san, slk, slv, sqi, srp, sot, swe, tgl, tur, tat, urd, vie, win, xho, yid, yua, zul | 1 |
| 11 Jan 2021 05:14 UTC | livingstoneonline/LEAP-MT | - | eng, afr, grc, nld, fra, gla, lat, ota, por, tsn, und, ara, bnt, hin, loz, lun, mck, sot, fas, toi, swh, arb, mlg, nym, lea, tur, heb, grk | 108 |
| 08 Jan 2025 11:53 UTC | livingstoneonline/LEAP-TEI | All the TEI files for Livingstone Online | eng, ajw, ara, fra, hin, lat, swh, und, por, gla, grk, grc, mlg, nym, fas, sco, arb, heb, lea, tur, afr, nld, ita, tsn, ell, deu, ota, bnt, loz, lun, mck, sot, toi, swa | 1020 |
| 28 Oct 2021 08:42 UTC | BetaMasaheft/BetMas | Exist-db application of the Beta Masaheft project | eng, amh, gez | 18 |
| 14 Dec 2020 01:33 UTC | BetaMasaheft/makepdf | make pdf repo | eng, amh, gez | 2 |
| 25 Nov 2022 16:57 UTC | BetaMasaheft/Authority-Files | Places, People and Taxonomies for Manuscripts and Works | eng, amh, gez, ara, ita | 602 |
| 30 May 2020 05:30 UTC | wvbe/shakespeare-to-the-max | - | eng, ang, ces, lat, fra, ell, deu, grc, ara, nld, grk, heb, ita, spa, swe, tur, enm, gmh, cym | 2723 |
| 30 May 2020 05:29 UTC | anasfkhan81/LMFEty | Fahad Khan and Jack Bowers LMF Etymological Materials | eng, ang, lat, deu, enm, fra, nld, ita, frm, mix, srd, por, jpn, urd, spa | 1 |
| 22 Sep 2021 08:40 UTC | OpenArabicPE/journal_al-jinan | Bibliographic metadata as TEI and MODS xml for the al-Bustānīs's fortnightly journal al-Jinān (الجنان) from Beirut, 1870-1885 | eng, ara | 385 |