TEIhub
Discover TEI-encoded documents from GitHub public repositories.

Last indexed Repository Description Languages Matching files
30 May 2020 05:29 UTC arjanski/​ceteicean-​test A bare-bones Vue 2.6 project with CETEIcean library for testing purposes - 1
30 May 2020 05:30 UTC markpbaggett/​base_​scout_​apps A base tei publisher app for all our SCOUT TEI migrations - 2
30 May 2020 05:30 UTC oelkapmis/​NLP-​-​-​Bigrams-​and-​Trigrams A basic python code to determine bigrams and trigrams from corpus via NLTK libraries - 1
08 Dec 2022 15:45 UTC gipplab/​pdf-​benchmark A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents eng 5
30 May 2020 05:29 UTC acdh-​oeaw/​howto A blog application which uses GitHub as data storage - 1
11 Jun 2021 17:11 UTC Jean-​Baptiste-​Camps/​ALTEI a bunch of scripts to manipulate ALTO and XML/TEI xno 1
12 Nov 2020 20:32 UTC helen189/​Laurence-​Sterne-​and-​Sterneana A Cambridge Digital Library Project fra, eng 113
30 May 2020 05:29 UTC dh-​nuigalway/​Personae A Character-Visualisation Tool for Dramatic Texts eng, grc, ell, grk, lat, ita, spa, fra, deu, nld 2
30 May 2020 05:29 UTC allenai/​citeomatic A citation recommendation system that allows users to find relevant citations for their paper drafts. The tool is backed by Semantic Scholar's OpenCorpus dataset. eng 2
30 May 2020 05:30 UTC nilsreiter/​generic-​xml-​reader A class to read in arbitrary XML content (including TEI) into UIMA, translating some structural annotation to stand off - 2
08 Jun 2022 05:41 UTC giannetti/​tei-​exercise a classroom exercise that uses the Firefox XSLT processor for an HTML preview - 1
18 Dec 2020 12:53 UTC acdh-​oeaw/​freud_​api_​crawler A client to interact with freud-net API - 2
19 Sep 2022 11:46 UTC freud-​digital/​freud_​api_​crawler A client to interact with the JSONAPI of https://www.freud-edition.net deu 7
30 Dec 2022 22:44 UTC joemac875/​poetexts A client-server system to text users poems that match desired tags. - 356
11 Sep 2020 08:32 UTC EleonoraPeruch/​lezioni-​americane A close reading of the first lecture of Italo Calvino's Lezioni americane, through the employment of XML technologies. ita, lat, fra, eng, deu 1
18 Dec 2022 13:43 UTC projectEndings/​staticSearch A codebase to support a pure JSON search engine requiring no backend for any XHTML5 document collection - 1
30 Jun 2021 12:58 UTC Furman-​Editions-​In-​Progress/​elijah-​furman-​itineraries A collaboration between the University of Haifa and the Elijah Lab and the Department of Classics at Furman University. - 6
06 Aug 2021 04:49 UTC ColmexBDCV/​dissertations_​as_​data A collaborative repository for Silvia Gutiérrez' and Rodrigo Cuéllar's current research on mining Electronic Thesis and Dissertations (ETD) as Data spa 4
30 May 2020 05:30 UTC performant-​software/​textlab A collaborative space for creating and publishing digital critical editions. - 6
30 May 2020 05:30 UTC ljo/​collatex-​tutorial A CollateX tutorial repo - 3
30 May 2020 05:30 UTC lfoppiano/​hedgehog A collection of applications and utilities of text extraction applied to several domains (history, geography, ...) fra, eng 1
30 May 2020 05:29 UTC AllynWaller/​ma-​thesis A collection of code and data from my Master's Thesis at Tufts University in the Digital Tools for Premodern Studies program eng, grc 9
30 May 2020 05:29 UTC charlottemueller/​hist3814o a collection of codes created and used by charlottemueller for hist3814o - 1
23 Aug 2022 08:52 UTC d-​flood/​criticus A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis. grc 1
04 May 2021 12:57 UTC arojascastro/​fabulasmitologicas A collection of Golden Age poems in Spanish in TEI and plain text - 26
30 May 2020 05:30 UTC JamesWolfe753/​First1KGreek A collection of Greek works from Homer to 250CE that do not already appear in the Perseus Digital Library (Open Greek and Latin Project) grc, lat, deu, eng, cop, nld, fra, mul, ita, ell 1373
30 May 2020 05:30 UTC GCDigitalFellows/​workshop-​resources A collection of handouts, cheatsheets, and other resources from Digital Fellows workshops. - 2
09 May 2022 11:41 UTC nevenjovanovic/​laudationes-​urbium-​dalmaticarum A collection of Latin texts praising cities in Renaissance Dalmatia; lemmatized, annotated grc 11
30 May 2020 05:29 UTC nweiler/​LeeToWells A collection of letters from Vernon Lee to H.G. Wells encoded in TEI. deu, eng 16
30 Dec 2022 09:45 UTC amclark42/​a-​life-​in-​lists A collection of lists. Self-maintained metadata, largely for my own use. - 13
30 May 2020 05:30 UTC thsh77/​textbase A collection of markdown texts - 4801
30 May 2020 05:30 UTC thsh77/​xslt A collection of retired stylesheets. - 1
30 May 2020 05:30 UTC bpwilcox/​bw-​projects A collection of robotics, control, and machine learning relevant projects over the years - 88
30 May 2020 05:30 UTC cstahmer/​text_​mining_​with_​r A collection of scripts for teaching and learning basic text mining methods in R - 44
30 May 2020 05:29 UTC csae8092/​XML-​Tests a collection of some xml files to play with eXgit modul - 13
26 Oct 2022 17:51 UTC evt-​project/​evt-​sample-​documents A collection of TEI documents used as edition examples in EVT. slv, ags, ang, lat, ara, ita, eng, lng, fra, spa 37
30 May 2020 05:29 UTC dannguyen/​scrapespeare A collection of The Bard's text for basic programming exercises and data mining. eng, fra, ita, lat, spa 42
30 May 2020 05:30 UTC dorothealint/​William_​Combe_​Works A collection of the works of William Combe for literary analysis - 23
30 May 2020 05:30 UTC mhbeals/​scissorsandpaste A collection of transcriptions from British newspapers (1789-1850) alongside originals from colonial and American newspapers, where relevant. - 443
30 May 2020 05:30 UTC jensopetersen/​mopane A collection of XQuery scripts facilitating standoff markup of TEI documents - 2
30 May 2020 05:30 UTC ravenray/​Yogurt_​Corpus A collections of posts having to do with the word vulnerability pulled from StackOverflow. - 7
27 Jul 2020 16:31 UTC qmoya/​Bekker A command-line utility that reads an Aristotelian work’s XML file from Perseus, and dumps it to the standard output in a Roam-Research-friendly format. lat, eng, deu 1
30 May 2020 05:29 UTC ouranobasis/​GreekDictionary A Console app for the LSJ Greek Dictionary eng, lat, fra 1
13 Oct 2021 08:41 UTC redewiedergabe/​corpus a corpus annotated for speech, thought and writing representation - 3599
20 Dec 2021 20:40 UTC anaistack/​cefr-​asag-​corpus A corpus of short answers written by learners of English and graded with CEFR levels - 708
30 May 2020 05:29 UTC rkurdiov/​hb3d-​documents a corpus of TEI encoded text resources for the vienna hofburg - 38
15 Aug 2022 10:48 UTC INL/​BlackLab A corpus retrieval engine based on Apache Lucene - 7
30 May 2020 05:29 UTC cite-​architecture/​ohco2 A cross-platform library for working with collections of texts in the OHCO2 model grc 2
30 May 2020 05:29 UTC Eumaeus/​cts-​demo-​corpus A CTS corpus containing a variety of texts, editions, translations, and exemplars. eng, grc, lat 11
30 May 2020 05:29 UTC nevenjovanovic/​modruski-​riario-​cts A CTS edition of Nicolaus of Modruš Latin oration for Pietro Riario (Rome, 1474) lat 9