TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
26 Mar 2026 06:18 UTC daedstudios/​read-​papers-​fast - eng 1
30 May 2020 05:29 UTC PonteIneptique/​tei-​conversion-​tools - eng 1
30 May 2020 05:29 UTC PerseusDL/​tei-​conversion-​tools Tools for TEI Conversions eng 1
12 Jun 2022 08:45 UTC GutenbergSource/​68201-​Rossiter-​Indian-​legends TEI source file of Harriet Rossiter: Indian legends from the land of Al-ay-ek-sa eng 1
28 Mar 2021 12:58 UTC sul-​dlss-​labs/​spoc Species Occurrences (SpOc), documentation available at https://sul-dlss-labs.github.io/spoc/ eng 1
05 Oct 2020 12:37 UTC ejmbrauchler/​BullingerWebApp - eng 3
30 May 2020 05:29 UTC pbexe/​syncref A tool to collaboratively synchronise research eng 1
30 May 2020 05:29 UTC waingram/​bamboo Fedora ingester for TCP content eng 2
30 May 2020 05:29 UTC puthurr/​tika Enhanced Tika version the handling embedded pictures better in PDF and Office documents eng 1
19 Sep 2022 08:03 UTC junemu/​QV Queens' Vernacular eng 2
30 May 2020 05:29 UTC wrt2dc/​fourteen Fourteen-line poems in TEI markup eng 3
30 May 2020 05:29 UTC kermitt2/​grobid-​astro A machine learning software for extracting astronomical entities from scholarly documents eng 1
30 May 2020 05:30 UTC petermr/​normami Merger of Norma and Ami eng 9
23 Aug 2022 17:47 UTC jeddobson/​ENGL64.​05-​22F Repository for ENGL 64.05/QSS 30.16 Cultural Analytics (Fall 2022) at Dartmouth College eng 82
30 May 2020 05:29 UTC philipakash/​lucius-​c-​smith-​diaries Automatically exported from code.google.com/p/lucius-c-smith-diaries eng 1
30 May 2020 05:30 UTC peterverhaar/​bdms bdms files eng 6
27 Jul 2020 16:31 UTC saarku/​fig-​explorer FigExplorer: A System for Retrieval and Exploration of Figures from Collections of Research Articles eng 2
30 May 2020 05:30 UTC aviamble/​TestAutomation - eng 565
30 Aug 2021 04:49 UTC CDRH/​data_​civilwardc Data Repository for Civil War Washington eng 3778
18 Sep 2023 13:50 UTC BalasubramanyamEvani/​anlp-​p2 anlp p2 Scientific NER eng 7
30 May 2020 05:30 UTC internetarchive/​sandcrawler Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki eng 4
23 Aug 2022 06:11 UTC jdmartin/​eltec-​text-​splitter Chunk English Novels Into Chapters eng 35
30 May 2020 05:29 UTC kkalouli/​CoUSBi The Corpus of US Bills eng 1
30 May 2020 05:30 UTC demery/​kalendar - eng 27
30 May 2020 05:30 UTC ahunker/​Hamilton-​Project Visit our project site here: http://hamilton.newtfire.org eng 237
01 Sep 2020 08:32 UTC Saranoja/​AdvancedProgramming Java Labs work for the Advanced Programming course of the 2nd year (CS) 2019-2020 eng 1
30 May 2020 05:30 UTC ericleasemorgan/​tei-​toolbox A set of scripts used to first create TEI files (with BBEdit), parse TEI files, and finally do simple analysis against the result eng 55
24 Mar 2023 16:53 UTC BeggarsOpera/​test test site for beggars opera gatsby site eng 4
30 May 2020 05:29 UTC Edirom/​Bargheer-​EdiromOnline Edirom Online application for the edition of the Bargheer Fiedellieder eng 5
16 Mar 2021 12:52 UTC kishkash555/​NLP-​for-​TA - eng 106
14 Sep 2020 16:32 UTC JonathanReeve/​jonreeve.​com My Personal Website eng 2
21 Oct 2021 08:41 UTC agile-​humanities/​ddhi-​aggregator - eng 6
30 May 2020 05:30 UTC IATH-​UVA/​uva-​lsi - eng 69
30 May 2020 05:30 UTC wolfgangmm/​tei-​simple-​pm An implementation of the TEI Simple ODD extensions for processing models in XQuery. eng 10
30 May 2020 05:30 UTC petermr/​climate OpenAccess papers mined for Climate Change eng 9
17 May 2021 08:48 UTC drevicko/​MeandreComponentFoundry Components for Meandre, a data-driven workflow tool by SEASR eng 1
30 May 2020 05:30 UTC charlietaylor98/​vangogh-​gang - eng 703
30 May 2020 05:30 UTC digicavendish/​xml-​transcripts-​EEBO-​TCP-​WilliamCavendish XML of William Cavendish's works created by the Text Creation Partnership and Early English Books Online (EEBO-TCP) eng 8
30 May 2020 05:29 UTC YU-​NLPLab/​DeepMet - eng 3
16 Mar 2026 00:15 UTC porchedduf/​Alchemy A digital edition of "De consideratione quinta essentia" in the Denison University manuscript. eng 1
30 May 2020 05:30 UTC IBM/​science-​result-​extractor - eng 346
30 May 2020 05:30 UTC elifesciences/​sciencebeam-​judge XML Conversion Evaluation eng 10
11 Apr 2022 22:43 UTC GutenbergSource/​67803-​Schneider-​Philippine-​Baptismal-​Names TEI source file of E. E. Schneider: A List of Philippine Baptismal Names. eng 1
30 May 2020 05:30 UTC KieranMigaku/​english-​sentence-​bank Create an english sentence bank from the british corpus eng 183
01 Oct 2020 20:32 UTC internetarchive/​fatcat-​scholar search interface for scholarly works eng 1
29 Jun 2021 20:36 UTC Kabongosalomon/​task-​dataset-​metric-​nli-​extraction This program produces the test data for classification over a set of predefined task#dataset#metrics#software labels. Given input a pdf file, it scrapes the text from the file using the Grobid parser, subsequently generating the test data file for input to the neural network classifier. eng 959
30 May 2020 05:30 UTC waynegraham/​cbw - eng 13
30 May 2020 05:29 UTC yoonlee95/​pdf_​extraction_​framework_​test - eng 12
05 Aug 2021 08:40 UTC ebeshero/​pacific repository for the Digital Archives and Pacific Cultures project eng 29
16 May 2021 08:47 UTC apache/​tika The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). eng 2