TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
08 Jun 2022 05:41 UTC giannetti/​tei-​exercise a classroom exercise that uses the Firefox XSLT processor for an HTML preview - 1
30 May 2020 05:30 UTC nilsreiter/​generic-​xml-​reader A class to read in arbitrary XML content (including TEI) into UIMA, translating some structural annotation to stand off - 2
30 May 2020 05:29 UTC allenai/​citeomatic A citation recommendation system that allows users to find relevant citations for their paper drafts. The tool is backed by Semantic Scholar's OpenCorpus dataset. eng 2
30 May 2020 05:29 UTC dh-​nuigalway/​Personae A Character-Visualisation Tool for Dramatic Texts eng, grc, ell, grk, lat, ita, spa, fra, deu, nld 2
12 Nov 2020 20:32 UTC helen189/​Laurence-​Sterne-​and-​Sterneana A Cambridge Digital Library Project fra, eng 113
11 Jun 2021 17:11 UTC Jean-​Baptiste-​Camps/​ALTEI a bunch of scripts to manipulate ALTO and XML/TEI xno 1
30 May 2020 05:29 UTC acdh-​oeaw/​howto A blog application which uses GitHub as data storage - 1
08 Dec 2022 15:45 UTC gipplab/​pdf-​benchmark A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents eng 5
30 May 2020 05:30 UTC oelkapmis/​NLP-​-​-​Bigrams-​and-​Trigrams A basic python code to determine bigrams and trigrams from corpus via NLTK libraries - 1
30 May 2020 05:30 UTC markpbaggett/​base_​scout_​apps A base tei publisher app for all our SCOUT TEI migrations - 2
30 May 2020 05:29 UTC arjanski/​ceteicean-​test A bare-bones Vue 2.6 project with CETEIcean library for testing purposes - 1
30 May 2020 05:29 UTC arjanski/​ceteicean-​test-​nuxt A bare-bones Nuxt.js 2.11.0 project with CETEIcean.js library - 1
30 May 2020 05:30 UTC mcytron/​XSLT-​XQUERY A backup of in-progress work on generating a TEI placeography - 45
22 Jul 2022 15:49 UTC acdh-​oeaw/​apis-​tei-​dumper A async crawler to dump APIS-Entities as TEI - 1
30 May 2020 05:30 UTC 84000/​data 84000 XML data files bod, san, zho, pli, eng, lat, jpn 4442
28 Mar 2023 20:45 UTC 84000/​exist-​apps 84000 eXist-db apps - 1
30 May 2020 05:29 UTC AlexT224/​BDH-​Modeling 610 activity with diamondback eng 3
23 Aug 2021 08:40 UTC janhorstmannn/​goethe-​prose-​drama 60 fictional texts by Johann Wolfgang Goethe in TXT format, prose, dramas, as well as fragments (sources: http://www.deutschestextarchiv.de and http://goethe.chadwyck.com) as well as annotations on irony and renunciation in XML, CSV, JSON. - 2
03 Feb 2022 20:36 UTC fkettelhoit/​wittgenstein-​nachlass-​xml 5000 pages of Wittgenstein's Nachlass as XML deu, eng 19
28 Feb 2023 03:47 UTC youngmin30/​2023visionpythonpycharm 2023visionpythonpycharm kor 1
30 May 2020 05:29 UTC DHNamedEntities/​19thCenturyFrenchNovels 19th century French novels manually annotated on named entities (places and fictional characters) - 3
22 May 2022 23:44 UTC IMAGO-​Catalogues-​Jjanes/​TEIcatalogs 19th and 20th exhibition catalogs in TEI and csv fra 6
07 Sep 2021 08:40 UTC Juliettejns/​TEIcatalogs 19th and 20th exhibition catalogs in TEI and csv fra 6
30 May 2020 05:30 UTC tkhagan/​dig_​eg_​official 1905-01-16 to 1905-01-21 - 6
17 Apr 2023 07:45 UTC e-​ditiones/​CORPUS17 17th c. French texts corpus. fra 30
30 May 2020 05:30 UTC CDRH/​cocoon_​theology 14th Century Oxford Theology Online - 43
10 Sep 2020 08:32 UTC Joelpie/​my_​repo_​110 110 repo - 1
30 May 2020 05:30 UTC oriflamms/​Dated-​and-​Datable-​Manuscripts_​LIRIS 102 documents with text and image aligned with learning-free techniques - 293
30 May 2020 05:30 UTC oriflamms/​Dated-​and-​Datable-​Manuscripts_​AI2A 101 documents with text and image aligned with Hidden Markov Models - 292
30 Nov 2020 12:42 UTC studio-​arrenberg/​engels-​briefe 📜 Interface für die Engels Ausstellung - 2
20 Jun 2021 04:52 UTC paavomare/​Bruckner-​Study-​Book-​Viewer 🎼 Digital music analysis with MEI on the example of Anton Bruckner's compositional studies - 1
18 May 2024 23:50 UTC rism-​digital/​verovio 🎵 Music notation engraving library for MEI with MusicXML and Humdrum support and various toolkits (JavaScript, Python) eng 2
30 May 2020 05:30 UTC paddymcall/​SARIT *Old* repository of the SARIT corpus san, eng 45
19 Jan 2021 13:26 UTC pelagios/​peripleo ***deprecated*** A search engine for the Pelagios universe, with a comprehensive JSON API. - 2
09 Feb 2021 04:41 UTC KU-​ORCAS/​manyoshuTEI 『廣瀬本万葉集』翻刻&TEI化プロジェクトは、関西大学アジア・オープン・リサーチセンター(KU-ORCAS)の研究ユニット4「古典籍の情報資源化プロジェクト」が進めている研究成果の一部です。 jpn 1
30 May 2020 05:30 UTC ccl0326/​nltk_​data [py] nltk.download() pol, eng 508
09 Mar 2021 12:52 UTC MyCoRe-​Org/​documentation [Deprecated] Homepage of MyCoRe community deu 5
30 May 2020 05:30 UTC jmolina116/​latin-​author-​identifier [December 2016] Class Project: Latin Authorship Identification final project for a Machine Learning class. eng, lat, ita, fra, deu, spa 366
25 Feb 2021 08:46 UTC textcreationpartnership (all repos) (textcreationpartnership uses one repository per text. To make this table smaller they have been aggregated into one entry) eng 39344
30 May 2020 05:29 UTC fcrepo3/​fcrepo-​historical (Archived - No longer maintained) Historical archive of early fcrepo code (everything pre-3.3) - 1
30 May 2020 05:29 UTC fcrepo3/​fcrepo-​before33 (Archived - No longer maintained) Fedora Commons Repository Service (Historic; This repo is > 300MB) - 1
30 May 2020 05:29 UTC fcrepo3/​fcrepo (Archived - No longer maintained) Fedora Commons Repository Service - 1
30 May 2020 05:29 UTC eeditiones/​workshop «Stay Home Learn TEI Publisher From Scratch» Online Workshop deu, fra 10
24 Jul 2022 22:45 UTC clirdlf/​old.​diglib.​org "Old" diblib site that was not ported eng 39
19 Apr 2021 01:44 UTC nishkalavallabhi/​LING410X-​Spring18 "Language as Data" course materials - 86
30 May 2020 05:30 UTC cetceeve/​ExploreDH "DH is the Study of dead Dudes" - mood - 139
30 May 2020 05:30 UTC hartwork/​rnv :tropical_fish: Relax NG Compact Syntax validator by David Tolpin; official upstream maintenance repository - 3
21 Oct 2021 20:38 UTC wellcomecollection/​catalogue-​pipeline :oil_drum: The data pipeline services extracting & transforming data from our museum and collections. ara, bbc, msa, eng, grc, san 9
30 May 2020 05:29 UTC conditor-​project/​co-​formatter :factory: module d'extraction du chapeau conditor fra, eng 5
30 May 2020 05:29 UTC cran/​stemmatology :exclamation: This is a read-only mirror of the CRAN R package repository. stemmatology — Stemmatological Analysis of Textual Traditions. Homepage: https://github.com/Jean-Baptiste-Camps/stemmatology Report bugs for this package: https://github.com/Jean-Baptiste-Camps/stemmatology/issues - 2