Discover TEI-encoded documents from GitHub public repositories.
Last indexed | Repository | Description | Languages | Matching files |
30 May 2020 05:29 UTC | arjanski/ceteicean-test-nuxt | A bare-bones Nuxt.js 2.11.0 project with CETEIcean.js library | - | 1 |
30 May 2020 05:29 UTC | arjanski/ceteicean-test | A bare-bones Vue 2.6 project with CETEIcean library for testing purposes | - | 1 |
30 May 2020 05:30 UTC | markpbaggett/base_scout_apps | A base tei publisher app for all our SCOUT TEI migrations | - | 2 |
30 May 2020 05:30 UTC | oelkapmis/NLP---Bigrams-and-Trigrams | A basic python code to determine bigrams and trigrams from corpus via NLTK libraries | - | 1 |
08 Dec 2022 15:45 UTC | gipplab/pdf-benchmark | A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents | eng | 5 |
30 May 2020 05:29 UTC | acdh-oeaw/howto | A blog application which uses GitHub as data storage | - | 1 |
11 Jun 2021 17:11 UTC | Jean-Baptiste-Camps/ALTEI | a bunch of scripts to manipulate ALTO and XML/TEI | xno | 1 |
12 Nov 2020 20:32 UTC | helen189/Laurence-Sterne-and-Sterneana | A Cambridge Digital Library Project | fra, eng | 113 |
30 May 2020 05:29 UTC | dh-nuigalway/Personae | A Character-Visualisation Tool for Dramatic Texts | eng, grc, ell, grk, lat, ita, spa, fra, deu, nld | 2 |
30 May 2020 05:29 UTC | allenai/citeomatic | A citation recommendation system that allows users to find relevant citations for their paper drafts. The tool is backed by Semantic Scholar's OpenCorpus dataset. | eng | 2 |
30 May 2020 05:30 UTC | nilsreiter/generic-xml-reader | A class to read in arbitrary XML content (including TEI) into UIMA, translating some structural annotation to stand off | - | 2 |
08 Jun 2022 05:41 UTC | giannetti/tei-exercise | a classroom exercise that uses the Firefox XSLT processor for an HTML preview | - | 1 |
18 Dec 2020 12:53 UTC | acdh-oeaw/freud_api_crawler | A client to interact with freud-net API | - | 2 |
19 Sep 2022 11:46 UTC | freud-digital/freud_api_crawler | A client to interact with the JSONAPI of | deu | 7 |
30 Dec 2022 22:44 UTC | joemac875/poetexts | A client-server system to text users poems that match desired tags. | - | 356 |
11 Sep 2020 08:32 UTC | EleonoraPeruch/lezioni-americane | A close reading of the first lecture of Italo Calvino's Lezioni americane, through the employment of XML technologies. | ita, lat, fra, eng, deu | 1 |
18 Dec 2022 13:43 UTC | projectEndings/staticSearch | A codebase to support a pure JSON search engine requiring no backend for any XHTML5 document collection | - | 1 |
30 Jun 2021 12:58 UTC | Furman-Editions-In-Progress/elijah-furman-itineraries | A collaboration between the University of Haifa and the Elijah Lab and the Department of Classics at Furman University. | - | 6 |
06 Aug 2021 04:49 UTC | ColmexBDCV/dissertations_as_data | A collaborative repository for Silvia Gutiérrez' and Rodrigo Cuéllar's current research on mining Electronic Thesis and Dissertations (ETD) as Data | spa | 4 |
30 May 2020 05:30 UTC | performant-software/textlab | A collaborative space for creating and publishing digital critical editions. | - | 6 |
30 May 2020 05:30 UTC | ljo/collatex-tutorial | A CollateX tutorial repo | - | 3 |
30 May 2020 05:30 UTC | lfoppiano/hedgehog | A collection of applications and utilities of text extraction applied to several domains (history, geography, ...) | fra, eng | 1 |
30 May 2020 05:29 UTC | AllynWaller/ma-thesis | A collection of code and data from my Master's Thesis at Tufts University in the Digital Tools for Premodern Studies program | eng, grc | 9 |
30 May 2020 05:29 UTC | charlottemueller/hist3814o | a collection of codes created and used by charlottemueller for hist3814o | - | 1 |
23 Aug 2022 08:52 UTC | d-flood/criticus | A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis. | grc | 1 |
04 May 2021 12:57 UTC | arojascastro/fabulasmitologicas | A collection of Golden Age poems in Spanish in TEI and plain text | - | 26 |
30 May 2020 05:30 UTC | JamesWolfe753/First1KGreek | A collection of Greek works from Homer to 250CE that do not already appear in the Perseus Digital Library (Open Greek and Latin Project) | grc, lat, deu, eng, cop, nld, fra, mul, ita, ell | 1373 |
30 May 2020 05:30 UTC | GCDigitalFellows/workshop-resources | A collection of handouts, cheatsheets, and other resources from Digital Fellows workshops. | - | 2 |
09 May 2022 11:41 UTC | nevenjovanovic/laudationes-urbium-dalmaticarum | A collection of Latin texts praising cities in Renaissance Dalmatia; lemmatized, annotated | grc | 11 |
30 May 2020 05:29 UTC | nweiler/LeeToWells | A collection of letters from Vernon Lee to H.G. Wells encoded in TEI. | deu, eng | 16 |
30 Dec 2022 09:45 UTC | amclark42/a-life-in-lists | A collection of lists. Self-maintained metadata, largely for my own use. | - | 13 |
30 May 2020 05:30 UTC | thsh77/textbase | A collection of markdown texts | - | 4801 |
30 May 2020 05:30 UTC | thsh77/xslt | A collection of retired stylesheets. | - | 1 |
30 May 2020 05:30 UTC | bpwilcox/bw-projects | A collection of robotics, control, and machine learning relevant projects over the years | - | 88 |
30 May 2020 05:30 UTC | cstahmer/text_mining_with_r | A collection of scripts for teaching and learning basic text mining methods in R | - | 44 |
30 May 2020 05:29 UTC | csae8092/XML-Tests | a collection of some xml files to play with eXgit modul | - | 13 |
26 Oct 2022 17:51 UTC | evt-project/evt-sample-documents | A collection of TEI documents used as edition examples in EVT. | slv, ags, ang, lat, ara, ita, eng, lng, fra, spa | 37 |
30 May 2020 05:29 UTC | dannguyen/scrapespeare | A collection of The Bard's text for basic programming exercises and data mining. | eng, fra, ita, lat, spa | 42 |
30 May 2020 05:30 UTC | dorothealint/William_Combe_Works | A collection of the works of William Combe for literary analysis | - | 23 |
30 May 2020 05:30 UTC | mhbeals/scissorsandpaste | A collection of transcriptions from British newspapers (1789-1850) alongside originals from colonial and American newspapers, where relevant. | - | 443 |
30 May 2020 05:30 UTC | jensopetersen/mopane | A collection of XQuery scripts facilitating standoff markup of TEI documents | - | 2 |
30 May 2020 05:30 UTC | ravenray/Yogurt_Corpus | A collections of posts having to do with the word vulnerability pulled from StackOverflow. | - | 7 |
27 Jul 2020 16:31 UTC | qmoya/Bekker | A command-line utility that reads an Aristotelian work’s XML file from Perseus, and dumps it to the standard output in a Roam-Research-friendly format. | lat, eng, deu | 1 |
30 May 2020 05:29 UTC | ouranobasis/GreekDictionary | A Console app for the LSJ Greek Dictionary | eng, lat, fra | 1 |
13 Oct 2021 08:41 UTC | redewiedergabe/corpus | a corpus annotated for speech, thought and writing representation | - | 3599 |
20 Dec 2021 20:40 UTC | anaistack/cefr-asag-corpus | A corpus of short answers written by learners of English and graded with CEFR levels | - | 708 |
30 May 2020 05:29 UTC | rkurdiov/hb3d-documents | a corpus of TEI encoded text resources for the vienna hofburg | - | 38 |
15 Aug 2022 10:48 UTC | INL/BlackLab | A corpus retrieval engine based on Apache Lucene | - | 7 |
30 May 2020 05:29 UTC | cite-architecture/ohco2 | A cross-platform library for working with collections of texts in the OHCO2 model | grc | 2 |
30 May 2020 05:29 UTC | Eumaeus/cts-demo-corpus | A CTS corpus containing a variety of texts, editions, translations, and exemplars. | eng, grc, lat | 11 |