Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 07 Feb 2021 20:36 UTC | ocularminds/flask-analytics | - | - | 501 |
| 30 May 2020 05:30 UTC | SashiniHansika/fyp | - | - | 501 |
| 30 May 2020 05:30 UTC | casics/nostril | Nostril: Nonsense String Evaluator | - | 501 |
| 30 May 2020 05:30 UTC | stephenysh/translate-sunil | - | - | 501 |
| 30 May 2020 05:30 UTC | francojc/recipes-curate_data | Repository to accompany the 'Curate Language Data' posts (1-2) for the Recipes series | - | 501 |
| 14 Sep 2020 16:32 UTC | markzuck24/NLP_SVM_POS | svm code | - | 501 |
| 30 May 2020 05:29 UTC | ericbarnhill/flask_app | Flask By Example app | - | 501 |
| 30 May 2020 05:29 UTC | giuseppecascavilla/topic_modelling | topic modelling on a dataset | - | 501 |
| 30 May 2020 05:30 UTC | steinhkl/tsgce | Tiny Statistical Grammar Checking Engine | - | 501 |
| 30 May 2020 05:29 UTC | jeffthemaximum/word-pair-frequency-calculator | A Flask app that calculates word-frequency pairs based on the text from a given URL | - | 501 |
| 08 Apr 2022 06:51 UTC | balajidileepkumar/Python_MachineLearning | From Basics Python to DataMining in Machine Learning | - | 501 |
| 30 May 2020 05:30 UTC | Horsmann/DkProTcIntegration | Integration tests for DKPro TC with larger data sets | - | 500 |
| 30 May 2020 05:29 UTC | freethenation/HMM | Playing Around with Hidden Markov Models | - | 500 |
| 30 May 2020 05:30 UTC | kb-dk/public-adl-text-sources | The texts used for building Archive for Danish Literature | - | 498 |
| 17 Sep 2020 04:32 UTC | shae128/xml-pdf.js | JavaScript/Node.js library to convert XML to PDF | lat | 497 |
| 13 Jul 2021 08:39 UTC | Cantavestrella/tei-ausiasmarch | Conversion from TEX format into TEI-XML of the synoptic diplomatic edition of 15-c. Ausiàs March's poems according to all witnesses. | cat | 489 |
| 15 Nov 2021 01:36 UTC | deutschestextarchiv/DiBiLit-Korpus | - | deu | 487 |
| 23 Feb 2021 08:43 UTC | piahh/Graphentheorie | Universitätskurs: Graphentheorie. | deu | 481 |
| 30 Dec 2020 17:28 UTC | tnhaider/antikoerperchen-german-annotated-poetry | German Canon Poetry Corpus with Annotation | deu | 477 |
| 30 May 2020 05:30 UTC | bncolorado/CorpusGeneralPoesiaLiricaCastellanaDelSigloDeOro | Corpus piloto para un corpus de referencia general de la poesía lírica castellana del Siglo de Oro. | - | 475 |
| 03 Sep 2021 12:56 UTC | Amleth/SHERLOCK | Social sciences & Humanities corpora Exploration and active Reading with Linked, Open & Contributive Knowledge organisation systems | fra | 473 |
| 30 May 2020 05:30 UTC | acdh-oeaw/glaser-tei | A eXist-db based web-app to process Glaser-Abklatsche | eng, inm | 472 |
| 15 Mar 2023 13:49 UTC | rh1967/rh1967.github.io | - | deu, eng | 471 |
| 30 May 2020 05:30 UTC | marianiku/gottlund | Metsäsuomalaiset > Gottlund | fin, sme | 463 |
| 13 Nov 2022 20:47 UTC | REEDLondon/inns-court | Inns of Court materials | eng, lat, fra | 463 |
| 08 Nov 2021 01:35 UTC | IRT2021/Merge-O-Bu-Njem | XML and stylesheets to merge O. Bu Njem data, text and translation from Papyri.info into a single EpiDoc file for IRT 2021 | fra, eng, deu, ita, spa, lat, ell | 462 |
| 14 Oct 2021 20:37 UTC | scta-texts/vn58an | - | lat | 461 |
| 27 Mar 2022 04:50 UTC | himmeproject/persons | Person data for the Historical Index of the Medieval Middle East | - | 461 |
| 15 Nov 2022 21:45 UTC | ANRChapitres/2000romans19e20e | Corpus de 2000 romans français du 19e et 20e siècles libres de droit en xml-tei | - | 460 |
| 30 May 2020 05:30 UTC | rsmccc/topic-model-ldavis | - | - | 451 |
| 29 Mar 2023 11:45 UTC | MiMoText/roman18 | Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800) | fra, ita, eng, deu | 448 |
| 30 May 2020 05:30 UTC | uvalib/dlps_scripts-webdocs | archive of the dlps workflow scripts (and documentation) from pogo.lib [before decommissioning] | eng, zho, fra, nld, rus, deu, spa, fil, ita, por, lat, ind, rom, ell, apa | 444 |
| 05 Mar 2021 08:45 UTC | lknelson/measuring_intersectionality | Code to reproduce the models and analysis in the paper "Leveraging the Alignment between Machine Learning and Intersectionality: Using Word Embeddings to Measure Intersectional Experiences of the Nineteenth Century U.S. South", by Laura K. Nelson | eng, fra, lat, ita, spa, gle, deu, ell, nld, por, cym, nai | 444 |
| 30 May 2020 05:30 UTC | mhbeals/scissorsandpaste | A collection of transcriptions from British newspapers (1789-1850) alongside originals from colonial and American newspapers, where relevant. | - | 443 |
| 18 Mar 2021 12:52 UTC | pablogalvezprojectosdaw/scissorsandpaste-master | - | - | 443 |
| 30 May 2020 05:30 UTC | cligs/projects | The CLiGS group's repository for code and data related to specific talks or publications. | fra | 441 |
| 01 Oct 2021 16:59 UTC | dig-eg-gaz/advertisements | images and xml text of ads used in Egyptian Gazette | fra | 439 |
| 30 Jun 2021 12:58 UTC | Alex-bzh/corpus-kaamelott | Corpus of screenplays from TV show Kaamelott | - | 433 |
| 15 Mar 2026 20:16 UTC | ADHO/dh2016 | Abstracts from the DH2016 conference in Kraków. | - | 431 |
| 30 May 2020 05:30 UTC | fbkarsdorp/story-network-data | Data accompanying the paper on story networks | - | 427 |
| 21 May 2021 13:05 UTC | lascivaroma/digiliblt | Capitains version of DigilibLT data | lat | 426 |
| 26 Jun 2022 15:45 UTC | scta-texts/n3av8a | - | lat | 424 |
| 06 Aug 2021 04:49 UTC | tnhaider/metrical-tagging-in-the-wild | - | eng | 419 |
| 30 May 2020 05:30 UTC | jhu-digital-manuscripts/rosademo | Backend services for annotation interop demo | - | 415 |
| 28 Jun 2022 13:23 UTC | WoPoss-project/source_texts | Works being curated prior to corpus creation | lat, grc, eng, deu, fra, ita | 415 |
| 26 Mar 2023 17:45 UTC | livingstoneonline/onemorevoice | This is the repository for One More Voice. One More Voice is a digital humanities recovery project that identifies, documents, and critically engages with the voices of racialized creators in British imperial and colonial archives. The voices take multiple forms and appear in multiple genres. Our project seeks to introduce these rich and diverse materials to broad academic and public audiences. Recourse to the voices promises to transform our understanding of imperial and colonial history and literature while foregrounding perspectives that scholarship in majority has hitherto overlooked or silenced. | eng, und, grc, tsn, swh, lat | 415 |
| 03 Apr 2023 23:46 UTC | whitmanarchive/whitman-manuscripts | Data Repo | Whitman Manuscripts TEI | - | 412 |
| 30 May 2020 05:30 UTC | Clara-Kloster/Guldkorpus | - | - | 411 |
| 30 May 2020 05:30 UTC | chriswolfram/ComputationalDiaries | Computational Editions of the Astronomical Diaries | akk | 408 |
| 09 Jun 2025 11:58 UTC | leoba/TEI-2-IIIF | XSLT for converting TEI MsDescription to IIIF manifests | lat | 402 |