Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 26 Mar 2026 06:18 UTC | daedstudios/read-papers-fast | - | eng | 1 |
| 30 May 2020 05:29 UTC | PonteIneptique/tei-conversion-tools | - | eng | 1 |
| 30 May 2020 05:29 UTC | PerseusDL/tei-conversion-tools | Tools for TEI Conversions | eng | 1 |
| 12 Jun 2022 08:45 UTC | GutenbergSource/68201-Rossiter-Indian-legends | TEI source file of Harriet Rossiter: Indian legends from the land of Al-ay-ek-sa | eng | 1 |
| 28 Mar 2021 12:58 UTC | sul-dlss-labs/spoc | Species Occurrences (SpOc), documentation available at https://sul-dlss-labs.github.io/spoc/ | eng | 1 |
| 05 Oct 2020 12:37 UTC | ejmbrauchler/BullingerWebApp | - | eng | 3 |
| 30 May 2020 05:29 UTC | pbexe/syncref | A tool to collaboratively synchronise research | eng | 1 |
| 30 May 2020 05:29 UTC | waingram/bamboo | Fedora ingester for TCP content | eng | 2 |
| 30 May 2020 05:29 UTC | puthurr/tika | Enhanced Tika version the handling embedded pictures better in PDF and Office documents | eng | 1 |
| 19 Sep 2022 08:03 UTC | junemu/QV | Queens' Vernacular | eng | 2 |
| 30 May 2020 05:29 UTC | wrt2dc/fourteen | Fourteen-line poems in TEI markup | eng | 3 |
| 30 May 2020 05:29 UTC | kermitt2/grobid-astro | A machine learning software for extracting astronomical entities from scholarly documents | eng | 1 |
| 30 May 2020 05:30 UTC | petermr/normami | Merger of Norma and Ami | eng | 9 |
| 23 Aug 2022 17:47 UTC | jeddobson/ENGL64.05-22F | Repository for ENGL 64.05/QSS 30.16 Cultural Analytics (Fall 2022) at Dartmouth College | eng | 82 |
| 30 May 2020 05:29 UTC | philipakash/lucius-c-smith-diaries | Automatically exported from code.google.com/p/lucius-c-smith-diaries | eng | 1 |
| 30 May 2020 05:30 UTC | peterverhaar/bdms | bdms files | eng | 6 |
| 27 Jul 2020 16:31 UTC | saarku/fig-explorer | FigExplorer: A System for Retrieval and Exploration of Figures from Collections of Research Articles | eng | 2 |
| 30 May 2020 05:30 UTC | aviamble/TestAutomation | - | eng | 565 |
| 30 Aug 2021 04:49 UTC | CDRH/data_civilwardc | Data Repository for Civil War Washington | eng | 3778 |
| 18 Sep 2023 13:50 UTC | BalasubramanyamEvani/anlp-p2 | anlp p2 Scientific NER | eng | 7 |
| 30 May 2020 05:30 UTC | internetarchive/sandcrawler | Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki | eng | 4 |
| 23 Aug 2022 06:11 UTC | jdmartin/eltec-text-splitter | Chunk English Novels Into Chapters | eng | 35 |
| 30 May 2020 05:29 UTC | kkalouli/CoUSBi | The Corpus of US Bills | eng | 1 |
| 30 May 2020 05:30 UTC | demery/kalendar | - | eng | 27 |
| 30 May 2020 05:30 UTC | ahunker/Hamilton-Project | Visit our project site here: http://hamilton.newtfire.org | eng | 237 |
| 01 Sep 2020 08:32 UTC | Saranoja/AdvancedProgramming | Java Labs work for the Advanced Programming course of the 2nd year (CS) 2019-2020 | eng | 1 |
| 30 May 2020 05:30 UTC | ericleasemorgan/tei-toolbox | A set of scripts used to first create TEI files (with BBEdit), parse TEI files, and finally do simple analysis against the result | eng | 55 |
| 24 Mar 2023 16:53 UTC | BeggarsOpera/test | test site for beggars opera gatsby site | eng | 4 |
| 30 May 2020 05:29 UTC | Edirom/Bargheer-EdiromOnline | Edirom Online application for the edition of the Bargheer Fiedellieder | eng | 5 |
| 16 Mar 2021 12:52 UTC | kishkash555/NLP-for-TA | - | eng | 106 |
| 14 Sep 2020 16:32 UTC | JonathanReeve/jonreeve.com | My Personal Website | eng | 2 |
| 21 Oct 2021 08:41 UTC | agile-humanities/ddhi-aggregator | - | eng | 6 |
| 30 May 2020 05:30 UTC | IATH-UVA/uva-lsi | - | eng | 69 |
| 30 May 2020 05:30 UTC | wolfgangmm/tei-simple-pm | An implementation of the TEI Simple ODD extensions for processing models in XQuery. | eng | 10 |
| 30 May 2020 05:30 UTC | petermr/climate | OpenAccess papers mined for Climate Change | eng | 9 |
| 17 May 2021 08:48 UTC | drevicko/MeandreComponentFoundry | Components for Meandre, a data-driven workflow tool by SEASR | eng | 1 |
| 30 May 2020 05:30 UTC | charlietaylor98/vangogh-gang | - | eng | 703 |
| 30 May 2020 05:30 UTC | digicavendish/xml-transcripts-EEBO-TCP-WilliamCavendish | XML of William Cavendish's works created by the Text Creation Partnership and Early English Books Online (EEBO-TCP) | eng | 8 |
| 30 May 2020 05:29 UTC | YU-NLPLab/DeepMet | - | eng | 3 |
| 16 Mar 2026 00:15 UTC | porchedduf/Alchemy | A digital edition of "De consideratione quinta essentia" in the Denison University manuscript. | eng | 1 |
| 30 May 2020 05:30 UTC | IBM/science-result-extractor | - | eng | 346 |
| 30 May 2020 05:30 UTC | elifesciences/sciencebeam-judge | XML Conversion Evaluation | eng | 10 |
| 11 Apr 2022 22:43 UTC | GutenbergSource/67803-Schneider-Philippine-Baptismal-Names | TEI source file of E. E. Schneider: A List of Philippine Baptismal Names. | eng | 1 |
| 30 May 2020 05:30 UTC | KieranMigaku/english-sentence-bank | Create an english sentence bank from the british corpus | eng | 183 |
| 01 Oct 2020 20:32 UTC | internetarchive/fatcat-scholar | search interface for scholarly works | eng | 1 |
| 29 Jun 2021 20:36 UTC | Kabongosalomon/task-dataset-metric-nli-extraction | This program produces the test data for classification over a set of predefined task#dataset#metrics#software labels. Given input a pdf file, it scrapes the text from the file using the Grobid parser, subsequently generating the test data file for input to the neural network classifier. | eng | 959 |
| 30 May 2020 05:30 UTC | waynegraham/cbw | - | eng | 13 |
| 30 May 2020 05:29 UTC | yoonlee95/pdf_extraction_framework_test | - | eng | 12 |
| 05 Aug 2021 08:40 UTC | ebeshero/pacific | repository for the Digital Archives and Pacific Cultures project | eng | 29 |
| 16 May 2021 08:47 UTC | apache/tika | The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). | eng | 2 |