TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
23 May 2022 08:51 UTC hawc2/​gatsby_​bo - eng 6
30 May 2020 05:29 UTC cindywu/​GROBID-​verification manually verifying GROBID's accuracy eng 1
30 May 2020 05:30 UTC swift-​poems-​project/​swift-​transcripts Transcripts for the Swift Poems Project eng 7835
30 May 2020 05:29 UTC GutenbergSource/​55948-​London-​The-​Abysmal-​Brute TEI master file of Jack London (1876–1916): The Abysmal Brute. eng 1
30 May 2020 05:30 UTC demery/​csvify_​tei - eng 544
29 Dec 2022 14:44 UTC Atypon-​OpenSource/​manuscripts-​manuscript-​transform - eng 1
29 Dec 2022 13:43 UTC Atypon-​OpenSource/​pressroom-​js - eng 1
30 May 2020 05:29 UTC Sebastian1984/​tika - eng 1
30 May 2020 05:29 UTC GutenbergSource/​48908-​Pratt-​Chadwick-​Legends-​of-​Norseland TEI master file of Mara Louise Pratt-Chadwick: Legends of Norseland. eng 1
01 Feb 2021 17:22 UTC lfoppiano/​SuperMat Superconductors material dataset eng 129
13 Oct 2020 08:35 UTC kanripox/​Laozi - eng 13
23 Aug 2021 16:59 UTC kermitt2/​grobid_​client_​python Python client for GROBID Web services eng 5
30 May 2020 05:29 UTC Ximenaflores/​Ximena-​tei-​test - eng 1
30 May 2020 05:29 UTC jojo2234/​LinComp Computational linguistics project for university. The file named readme is written in Italian. eng 2
28 Feb 2023 18:50 UTC Ang3licaValdo/​AIandOpenScienceInResearchSoftwareEngineering Repository for Artificial Intelligence and Open Science In Research Software Engineering deliverables. eng 52
06 Dec 2022 15:46 UTC VaCoArg/​Grupo1 Actividad de Minimal Computing eng 4
01 Dec 2022 23:45 UTC vharvay/​Busa-​s-​Enthusiasts - eng 16
30 Aug 2022 06:41 UTC ieg-​dhr/​DigitaleEditorikDMGK Daten und Lehrmaterial aus dem Modul "Digitale Editorik Historischer Quellen" im DMGK Studiengang Mainz eng 7
01 Mar 2023 03:19 UTC adsabs/​pdfie-​training-​data ADS PDF information extraction training data eng 8
30 May 2020 05:30 UTC amitgayar/​bert_​hr Pdfs are parsed by Grobid java utility using python wrapper and results are fed to the custom trained BERT model for predictions. eng 1
22 Aug 2022 15:51 UTC vikhil0609/​vikhil_​grobid_​main - eng 199
18 Apr 2023 21:46 UTC Atlas1225/​OpenData - eng 10
30 May 2020 05:30 UTC tnhaider/​english-​gutenberg-​poetry English Poetry Corpus mined with GutenTag eng 1426
13 Aug 2021 08:40 UTC lfoppiano/​grobid-​superconductors Grobid module for superconductor material and properties extraction eng 25
07 Aug 2022 23:44 UTC vikhil0609/​grobid_​test - eng 48
24 Jul 2022 22:45 UTC marcoparker/​Data-​Science - eng 1
20 Jan 2023 09:45 UTC lulman/​anderson-​letters Source files for anderson letters eng 1
29 Aug 2022 06:36 UTC vikhil0609/​grobidTesting - eng 293
30 May 2020 05:29 UTC ContentMine/​cm-​ucl A repository to openly track progress on table extraction. eng 2
14 Jan 2023 17:43 UTC LiteratureInContext/​LiC-​data XML data storage site for Literature in Context. Staging on development branch (http://anthologydev.lib.virginia.edu) and production on master branch (http://anthology.lib.virginia.edu). Images are hosted on AWS. eng 144
25 Feb 2021 08:46 UTC textcreationpartnership (all repos) (textcreationpartnership uses one repository per text. To make this table smaller they have been aggregated into one entry) eng 39344
30 May 2020 05:29 UTC awisnicki/​islandora_​drupal_​subsite_​livingstone - eng 1
30 May 2020 05:30 UTC leoba/​mesa MESA RDF generation files eng 297
15 Dec 2020 01:31 UTC tnhaider/​epg64-​english-​poetry-​annotated - eng 22
30 May 2020 05:29 UTC GutenbergSource/​10928-​Devi-​Bengal-​Dacoits-​and-​Tigers TEI master file of Sunity Devi (1864–1932): Bengal Dacoits and Tigers. eng 1
30 May 2020 05:29 UTC jeschollaert/​Diamondback-​Encoding - eng 2
30 May 2020 05:29 UTC curationexperts/​tufts_​models Hydra models for Tufts eng 4
30 May 2020 05:30 UTC lb42/​BVH Bibliotheques Virtuels des Humanistes, CESR, Tours eng 38
11 Feb 2021 01:15 UTC gethsun1/​ethiopia_​data - eng 1
29 Apr 2021 08:46 UTC jieyanzhu/​hacking-​the-​archive - eng 1
13 Mar 2021 20:39 UTC lb42/​guyMemorial Sources for guy Memorabilia eng 3
30 May 2020 05:29 UTC lulman/​stephens-​letters Automatically exported from code.google.com/p/stephens-letters eng 1
30 May 2020 05:29 UTC ajithlal1992/​vitalopensource Automatically exported from code.google.com/p/vitalopensource eng 2
15 Dec 2022 10:48 UTC Machine-​Learning-​Pipelines/​repro-​screener - eng 99
30 May 2020 05:29 UTC Ashish74/​vitalopensource Automatically exported from code.google.com/p/vitalopensource eng 2
08 Jan 2025 21:52 UTC KislakCenter/​VisColl Modeling and visualizing physical manuscript collation eng 3623
07 Oct 2022 23:56 UTC giladghgh/​Zipfs-​Law A layman's introduction to Zipf's Law through computational linguistics. eng 183
03 Apr 2023 02:51 UTC marinettevolte/​projet-​hn4 - eng 8
30 May 2020 05:29 UTC scta/​simple-​tei-​edition - eng 2
15 Apr 2021 12:58 UTC joeytakeda/​xml-​validate-​action RNG Validation action eng 7