Discover TEI-encoded documents from GitHub public repositories.
| Last indexed | Repository | Description | Languages | Matching files |
|---|---|---|---|---|
| 26 Jun 2022 15:45 UTC | scta-texts/n3av8a | - | lat | 424 |
| 21 May 2021 13:05 UTC | lascivaroma/digiliblt | Capitains version of DigilibLT data | lat | 426 |
| 30 May 2020 05:30 UTC | fbkarsdorp/story-network-data | Data accompanying the paper on story networks | - | 427 |
| 14 Apr 2026 12:00 UTC | ADHO/dh2016 | Abstracts from the DH2016 conference in Kraków. | - | 431 |
| 30 Jun 2021 12:58 UTC | Alex-bzh/corpus-kaamelott | Corpus of screenplays from TV show Kaamelott | - | 433 |
| 01 Oct 2021 16:59 UTC | dig-eg-gaz/advertisements | images and xml text of ads used in Egyptian Gazette | fra | 439 |
| 30 May 2020 05:30 UTC | cligs/projects | The CLiGS group's repository for code and data related to specific talks or publications. | fra | 441 |
| 30 May 2020 05:30 UTC | mhbeals/scissorsandpaste | A collection of transcriptions from British newspapers (1789-1850) alongside originals from colonial and American newspapers, where relevant. | - | 443 |
| 18 Mar 2021 12:52 UTC | pablogalvezprojectosdaw/scissorsandpaste-master | - | - | 443 |
| 05 Mar 2021 08:45 UTC | lknelson/measuring_intersectionality | Code to reproduce the models and analysis in the paper "Leveraging the Alignment between Machine Learning and Intersectionality: Using Word Embeddings to Measure Intersectional Experiences of the Nineteenth Century U.S. South", by Laura K. Nelson | eng, fra, lat, ita, spa, gle, deu, ell, nld, por, cym, nai | 444 |
| 30 May 2020 05:30 UTC | uvalib/dlps_scripts-webdocs | archive of the dlps workflow scripts (and documentation) from pogo.lib [before decommissioning] | eng, zho, fra, nld, rus, deu, spa, fil, ita, por, lat, ind, rom, ell, apa | 444 |
| 29 Mar 2023 11:45 UTC | MiMoText/roman18 | Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800) | fra, ita, eng, deu | 448 |
| 30 May 2020 05:30 UTC | rsmccc/topic-model-ldavis | - | - | 451 |
| 15 Nov 2022 21:45 UTC | ANRChapitres/2000romans19e20e | Corpus de 2000 romans français du 19e et 20e siècles libres de droit en xml-tei | - | 460 |
| 27 Mar 2022 04:50 UTC | himmeproject/persons | Person data for the Historical Index of the Medieval Middle East | - | 461 |
| 14 Oct 2021 20:37 UTC | scta-texts/vn58an | - | lat | 461 |
| 08 Nov 2021 01:35 UTC | IRT2021/Merge-O-Bu-Njem | XML and stylesheets to merge O. Bu Njem data, text and translation from Papyri.info into a single EpiDoc file for IRT 2021 | fra, eng, deu, ita, spa, lat, ell | 462 |
| 30 May 2020 05:30 UTC | marianiku/gottlund | Metsäsuomalaiset > Gottlund | fin, sme | 463 |
| 13 Nov 2022 20:47 UTC | REEDLondon/inns-court | Inns of Court materials | eng, lat, fra | 463 |
| 15 Mar 2023 13:49 UTC | rh1967/rh1967.github.io | - | deu, eng | 471 |
| 30 May 2020 05:30 UTC | acdh-oeaw/glaser-tei | A eXist-db based web-app to process Glaser-Abklatsche | eng, inm | 472 |
| 03 Sep 2021 12:56 UTC | Amleth/SHERLOCK | Social sciences & Humanities corpora Exploration and active Reading with Linked, Open & Contributive Knowledge organisation systems | fra | 473 |
| 30 May 2020 05:30 UTC | bncolorado/CorpusGeneralPoesiaLiricaCastellanaDelSigloDeOro | Corpus piloto para un corpus de referencia general de la poesía lírica castellana del Siglo de Oro. | - | 475 |
| 30 Dec 2020 17:28 UTC | tnhaider/antikoerperchen-german-annotated-poetry | German Canon Poetry Corpus with Annotation | deu | 477 |
| 23 Feb 2021 08:43 UTC | piahh/Graphentheorie | Universitätskurs: Graphentheorie. | deu | 481 |
| 15 Nov 2021 01:36 UTC | deutschestextarchiv/DiBiLit-Korpus | - | deu | 487 |
| 13 Jul 2021 08:39 UTC | Cantavestrella/tei-ausiasmarch | Conversion from TEX format into TEI-XML of the synoptic diplomatic edition of 15-c. Ausiàs March's poems according to all witnesses. | cat | 489 |
| 17 Sep 2020 04:32 UTC | shae128/xml-pdf.js | JavaScript/Node.js library to convert XML to PDF | lat | 497 |
| 30 May 2020 05:30 UTC | kb-dk/public-adl-text-sources | The texts used for building Archive for Danish Literature | - | 498 |
| 30 May 2020 05:30 UTC | Horsmann/DkProTcIntegration | Integration tests for DKPro TC with larger data sets | - | 500 |
| 30 May 2020 05:29 UTC | freethenation/HMM | Playing Around with Hidden Markov Models | - | 500 |
| 14 Sep 2020 16:32 UTC | markzuck24/NLP_SVM_POS | svm code | - | 501 |
| 30 May 2020 05:29 UTC | ericbarnhill/flask_app | Flask By Example app | - | 501 |
| 30 May 2020 05:30 UTC | SashiniHansika/fyp | - | - | 501 |
| 30 May 2020 05:29 UTC | giuseppecascavilla/topic_modelling | topic modelling on a dataset | - | 501 |
| 18 Oct 2021 08:41 UTC | lin380/tadr | Text as Data Resources | - | 501 |
| 30 May 2020 05:29 UTC | jeffthemaximum/word-pair-frequency-calculator | A Flask app that calculates word-frequency pairs based on the text from a given URL | - | 501 |
| 08 Apr 2022 06:51 UTC | balajidileepkumar/Python_MachineLearning | From Basics Python to DataMining in Machine Learning | - | 501 |
| 30 May 2020 05:30 UTC | bxie/ai2_analysis | Data Analysis for App Inventor | - | 501 |
| 30 May 2020 05:30 UTC | francojc/recipes-curate_data | Repository to accompany the 'Curate Language Data' posts (1-2) for the Recipes series | - | 501 |
| 30 May 2020 05:30 UTC | casics/nostril | Nostril: Nonsense String Evaluator | - | 501 |
| 30 May 2020 05:30 UTC | vikramraodp/virginia | - | - | 501 |
| 30 May 2020 05:29 UTC | jorfsson/chatbot | Chatbot practice | - | 501 |
| 07 Feb 2021 20:36 UTC | ocularminds/flask-analytics | - | - | 501 |
| 30 May 2020 05:30 UTC | stephenysh/translate-sunil | - | - | 501 |
| 30 May 2020 05:30 UTC | steinhkl/tsgce | Tiny Statistical Grammar Checking Engine | - | 501 |
| 08 Jul 2021 20:36 UTC | fflah/reksis | - | - | 502 |
| 04 Jul 2020 08:31 UTC | OSH-2020/GDBFS | x-code-nowww created by GitHub Classroom | - | 502 |
| 30 Mar 2023 21:46 UTC | CDRH/data_teaa | Data Repository for To Enter Africa from America | fra, eng | 504 |
| 30 Sep 2022 21:51 UTC | srophe/srophe-xQueries | xQuery scripts written for use with Syriaca.org data (not bundled with the eXist app) | grc, lat, syr, eng, ara, fra, deu | 505 |