TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
07 Feb 2021 20:36 UTC ocularminds/​flask-​analytics - - 501
30 May 2020 05:30 UTC SashiniHansika/​fyp - - 501
30 May 2020 05:30 UTC casics/​nostril Nostril: Nonsense String Evaluator - 501
30 May 2020 05:30 UTC stephenysh/​translate-​sunil - - 501
30 May 2020 05:30 UTC francojc/​recipes-​curate_​data Repository to accompany the 'Curate Language Data' posts (1-2) for the Recipes series - 501
14 Sep 2020 16:32 UTC markzuck24/​NLP_​SVM_​POS svm code - 501
30 May 2020 05:29 UTC ericbarnhill/​flask_​app Flask By Example app - 501
30 May 2020 05:29 UTC giuseppecascavilla/​topic_​modelling topic modelling on a dataset - 501
30 May 2020 05:30 UTC steinhkl/​tsgce Tiny Statistical Grammar Checking Engine - 501
30 May 2020 05:29 UTC jeffthemaximum/​word-​pair-​frequency-​calculator A Flask app that calculates word-frequency pairs based on the text from a given URL - 501
08 Apr 2022 06:51 UTC balajidileepkumar/​Python_​MachineLearning From Basics Python to DataMining in Machine Learning - 501
30 May 2020 05:30 UTC Horsmann/​DkProTcIntegration Integration tests for DKPro TC with larger data sets - 500
30 May 2020 05:29 UTC freethenation/​HMM Playing Around with Hidden Markov Models - 500
30 May 2020 05:30 UTC kb-​dk/​public-​adl-​text-​sources The texts used for building Archive for Danish Literature - 498
17 Sep 2020 04:32 UTC shae128/​xml-​pdf.​js JavaScript/Node.js library to convert XML to PDF lat 497
13 Jul 2021 08:39 UTC Cantavestrella/​tei-​ausiasmarch Conversion from TEX format into TEI-XML of the synoptic diplomatic edition of 15-c. Ausiàs March's poems according to all witnesses. cat 489
15 Nov 2021 01:36 UTC deutschestextarchiv/​DiBiLit-​Korpus - deu 487
23 Feb 2021 08:43 UTC piahh/​Graphentheorie Universitätskurs: Graphentheorie. deu 481
30 Dec 2020 17:28 UTC tnhaider/​antikoerperchen-​german-​annotated-​poetry German Canon Poetry Corpus with Annotation deu 477
30 May 2020 05:30 UTC bncolorado/​CorpusGeneralPoesiaLiricaCastellanaDelSigloDeOro Corpus piloto para un corpus de referencia general de la poesía lírica castellana del Siglo de Oro. - 475
03 Sep 2021 12:56 UTC Amleth/​SHERLOCK Social sciences & Humanities corpora Exploration and active Reading with Linked, Open & Contributive Knowledge organisation systems fra 473
30 May 2020 05:30 UTC acdh-​oeaw/​glaser-​tei A eXist-db based web-app to process Glaser-Abklatsche eng, inm 472
15 Mar 2023 13:49 UTC rh1967/​rh1967.​github.​io - deu, eng 471
30 May 2020 05:30 UTC marianiku/​gottlund Metsäsuomalaiset > Gottlund fin, sme 463
13 Nov 2022 20:47 UTC REEDLondon/​inns-​court Inns of Court materials eng, lat, fra 463
08 Nov 2021 01:35 UTC IRT2021/​Merge-​O-​Bu-​Njem XML and stylesheets to merge O. Bu Njem data, text and translation from Papyri.info into a single EpiDoc file for IRT 2021 fra, eng, deu, ita, spa, lat, ell 462
14 Oct 2021 20:37 UTC scta-​texts/​vn58an - lat 461
27 Mar 2022 04:50 UTC himmeproject/​persons Person data for the Historical Index of the Medieval Middle East - 461
15 Nov 2022 21:45 UTC ANRChapitres/​2000romans19e20e Corpus de 2000 romans français du 19e et 20e siècles libres de droit en xml-tei - 460
30 May 2020 05:30 UTC rsmccc/​topic-​model-​ldavis - - 451
29 Mar 2023 11:45 UTC MiMoText/​roman18 Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800) fra, ita, eng, deu 448
30 May 2020 05:30 UTC uvalib/​dlps_​scripts-​webdocs archive of the dlps workflow scripts (and documentation) from pogo.lib [before decommissioning] eng, zho, fra, nld, rus, deu, spa, fil, ita, por, lat, ind, rom, ell, apa 444
05 Mar 2021 08:45 UTC lknelson/​measuring_​intersectionality Code to reproduce the models and analysis in the paper "Leveraging the Alignment between Machine Learning and Intersectionality: Using Word Embeddings to Measure Intersectional Experiences of the Nineteenth Century U.S. South", by Laura K. Nelson eng, fra, lat, ita, spa, gle, deu, ell, nld, por, cym, nai 444
30 May 2020 05:30 UTC mhbeals/​scissorsandpaste A collection of transcriptions from British newspapers (1789-1850) alongside originals from colonial and American newspapers, where relevant. - 443
18 Mar 2021 12:52 UTC pablogalvezprojectosdaw/​scissorsandpaste-​master - - 443
30 May 2020 05:30 UTC cligs/​projects The CLiGS group's repository for code and data related to specific talks or publications. fra 441
01 Oct 2021 16:59 UTC dig-​eg-​gaz/​advertisements images and xml text of ads used in Egyptian Gazette fra 439
30 Jun 2021 12:58 UTC Alex-​bzh/​corpus-​kaamelott Corpus of screenplays from TV show Kaamelott - 433
15 Mar 2026 20:16 UTC ADHO/​dh2016 Abstracts from the DH2016 conference in Kraków. - 431
30 May 2020 05:30 UTC fbkarsdorp/​story-​network-​data Data accompanying the paper on story networks - 427
21 May 2021 13:05 UTC lascivaroma/​digiliblt Capitains version of DigilibLT data lat 426
26 Jun 2022 15:45 UTC scta-​texts/​n3av8a - lat 424
06 Aug 2021 04:49 UTC tnhaider/​metrical-​tagging-​in-​the-​wild - eng 419
30 May 2020 05:30 UTC jhu-​digital-​manuscripts/​rosademo Backend services for annotation interop demo - 415
28 Jun 2022 13:23 UTC WoPoss-​project/​source_​texts Works being curated prior to corpus creation lat, grc, eng, deu, fra, ita 415
26 Mar 2023 17:45 UTC livingstoneonline/​onemorevoice This is the repository for One More Voice. One More Voice is a digital humanities recovery project that identifies, documents, and critically engages with the voices of racialized creators in British imperial and colonial archives. The voices take multiple forms and appear in multiple genres. Our project seeks to introduce these rich and diverse materials to broad academic and public audiences. Recourse to the voices promises to transform our understanding of imperial and colonial history and literature while foregrounding perspectives that scholarship in majority has hitherto overlooked or silenced. eng, und, grc, tsn, swh, lat 415
03 Apr 2023 23:46 UTC whitmanarchive/​whitman-​manuscripts Data Repo | Whitman Manuscripts TEI - 412
30 May 2020 05:30 UTC Clara-​Kloster/​Guldkorpus - - 411
30 May 2020 05:30 UTC chriswolfram/​ComputationalDiaries Computational Editions of the Astronomical Diaries akk 408
09 Jun 2025 11:58 UTC leoba/​TEI-​2-​IIIF XSLT for converting TEI MsDescription to IIIF manifests lat 402