Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 23 May 2022 08:51 UTC | hawc2/gatsby_bo | - | eng | 6 |
| 30 May 2020 05:29 UTC | cindywu/GROBID-verification | manually verifying GROBID's accuracy | eng | 1 |
| 30 May 2020 05:30 UTC | swift-poems-project/swift-transcripts | Transcripts for the Swift Poems Project | eng | 7835 |
| 30 May 2020 05:29 UTC | GutenbergSource/55948-London-The-Abysmal-Brute | TEI master file of Jack London (1876–1916): The Abysmal Brute. | eng | 1 |
| 30 May 2020 05:30 UTC | demery/csvify_tei | - | eng | 544 |
| 29 Dec 2022 14:44 UTC | Atypon-OpenSource/manuscripts-manuscript-transform | - | eng | 1 |
| 29 Dec 2022 13:43 UTC | Atypon-OpenSource/pressroom-js | - | eng | 1 |
| 30 May 2020 05:29 UTC | Sebastian1984/tika | - | eng | 1 |
| 30 May 2020 05:29 UTC | GutenbergSource/48908-Pratt-Chadwick-Legends-of-Norseland | TEI master file of Mara Louise Pratt-Chadwick: Legends of Norseland. | eng | 1 |
| 01 Feb 2021 17:22 UTC | lfoppiano/SuperMat | Superconductors material dataset | eng | 129 |
| 13 Oct 2020 08:35 UTC | kanripox/Laozi | - | eng | 13 |
| 23 Aug 2021 16:59 UTC | kermitt2/grobid_client_python | Python client for GROBID Web services | eng | 5 |
| 30 May 2020 05:29 UTC | Ximenaflores/Ximena-tei-test | - | eng | 1 |
| 30 May 2020 05:29 UTC | jojo2234/LinComp | Computational linguistics project for university. The file named readme is written in Italian. | eng | 2 |
| 28 Feb 2023 18:50 UTC | Ang3licaValdo/AIandOpenScienceInResearchSoftwareEngineering | Repository for Artificial Intelligence and Open Science In Research Software Engineering deliverables. | eng | 52 |
| 06 Dec 2022 15:46 UTC | VaCoArg/Grupo1 | Actividad de Minimal Computing | eng | 4 |
| 01 Dec 2022 23:45 UTC | vharvay/Busa-s-Enthusiasts | - | eng | 16 |
| 30 Aug 2022 06:41 UTC | ieg-dhr/DigitaleEditorikDMGK | Daten und Lehrmaterial aus dem Modul "Digitale Editorik Historischer Quellen" im DMGK Studiengang Mainz | eng | 7 |
| 01 Mar 2023 03:19 UTC | adsabs/pdfie-training-data | ADS PDF information extraction training data | eng | 8 |
| 30 May 2020 05:30 UTC | amitgayar/bert_hr | Pdfs are parsed by Grobid java utility using python wrapper and results are fed to the custom trained BERT model for predictions. | eng | 1 |
| 22 Aug 2022 15:51 UTC | vikhil0609/vikhil_grobid_main | - | eng | 199 |
| 18 Apr 2023 21:46 UTC | Atlas1225/OpenData | - | eng | 10 |
| 30 May 2020 05:30 UTC | tnhaider/english-gutenberg-poetry | English Poetry Corpus mined with GutenTag | eng | 1426 |
| 13 Aug 2021 08:40 UTC | lfoppiano/grobid-superconductors | Grobid module for superconductor material and properties extraction | eng | 25 |
| 07 Aug 2022 23:44 UTC | vikhil0609/grobid_test | - | eng | 48 |
| 24 Jul 2022 22:45 UTC | marcoparker/Data-Science | - | eng | 1 |
| 20 Jan 2023 09:45 UTC | lulman/anderson-letters | Source files for anderson letters | eng | 1 |
| 29 Aug 2022 06:36 UTC | vikhil0609/grobidTesting | - | eng | 293 |
| 30 May 2020 05:29 UTC | ContentMine/cm-ucl | A repository to openly track progress on table extraction. | eng | 2 |
| 14 Jan 2023 17:43 UTC | LiteratureInContext/LiC-data | XML data storage site for Literature in Context. Staging on development branch (http://anthologydev.lib.virginia.edu) and production on master branch (http://anthology.lib.virginia.edu). Images are hosted on AWS. | eng | 144 |
| 25 Feb 2021 08:46 UTC | textcreationpartnership (all repos) | (textcreationpartnership uses one repository per text. To make this table smaller they have been aggregated into one entry) | eng | 39344 |
| 30 May 2020 05:29 UTC | awisnicki/islandora_drupal_subsite_livingstone | - | eng | 1 |
| 30 May 2020 05:30 UTC | leoba/mesa | MESA RDF generation files | eng | 297 |
| 15 Dec 2020 01:31 UTC | tnhaider/epg64-english-poetry-annotated | - | eng | 22 |
| 30 May 2020 05:29 UTC | GutenbergSource/10928-Devi-Bengal-Dacoits-and-Tigers | TEI master file of Sunity Devi (1864–1932): Bengal Dacoits and Tigers. | eng | 1 |
| 30 May 2020 05:29 UTC | jeschollaert/Diamondback-Encoding | - | eng | 2 |
| 30 May 2020 05:29 UTC | curationexperts/tufts_models | Hydra models for Tufts | eng | 4 |
| 30 May 2020 05:30 UTC | lb42/BVH | Bibliotheques Virtuels des Humanistes, CESR, Tours | eng | 38 |
| 11 Feb 2021 01:15 UTC | gethsun1/ethiopia_data | - | eng | 1 |
| 29 Apr 2021 08:46 UTC | jieyanzhu/hacking-the-archive | - | eng | 1 |
| 13 Mar 2021 20:39 UTC | lb42/guyMemorial | Sources for guy Memorabilia | eng | 3 |
| 30 May 2020 05:29 UTC | lulman/stephens-letters | Automatically exported from code.google.com/p/stephens-letters | eng | 1 |
| 30 May 2020 05:29 UTC | ajithlal1992/vitalopensource | Automatically exported from code.google.com/p/vitalopensource | eng | 2 |
| 15 Dec 2022 10:48 UTC | Machine-Learning-Pipelines/repro-screener | - | eng | 99 |
| 30 May 2020 05:29 UTC | Ashish74/vitalopensource | Automatically exported from code.google.com/p/vitalopensource | eng | 2 |
| 08 Jan 2025 21:52 UTC | KislakCenter/VisColl | Modeling and visualizing physical manuscript collation | eng | 3623 |
| 07 Oct 2022 23:56 UTC | giladghgh/Zipfs-Law | A layman's introduction to Zipf's Law through computational linguistics. | eng | 183 |
| 03 Apr 2023 02:51 UTC | marinettevolte/projet-hn4 | - | eng | 8 |
| 30 May 2020 05:29 UTC | scta/simple-tei-edition | - | eng | 2 |
| 15 Apr 2021 12:58 UTC | joeytakeda/xml-validate-action | RNG Validation action | eng | 7 |