Sample data set exemplifying an idealized data processing pipeline for didactic purposes
-
Updated
Jun 12, 2024 - Python
Sample data set exemplifying an idealized data processing pipeline for didactic purposes
Dataset of the University of Basel's research seminar "Indexing and Digital Processing of a Historical Image Collection on the Appropriation of Buddhism in the West"
Upload multiple directories as multiple documents to collection in Transkribus using its REST API
django app to interact with Transkribus-API
Custom named entity recognition (persons, locations) using spaCy for German texts annotated in Transkribus
Update the PageXML of multiple documents in a collection in Transkribus using its REST API
A pipeline to transfer ground truth from Transkribus to eScriptorium.
Scripts and utility functions to ingest Transkribus Data into ARCHE
Batch upload and update documents with Transkribus' REST API.
Create ready-to-use Label Studio pre-populated JSON files from popular OCR formats.
Post-process PageXMLs to better the reading order of regions
Add transcriptions to items in Tropy using the Transkribus metagrapho API
Script pour interroger l'API de Transkribus et générer des fichiers XML-TEI et leur métadonnées
A python package providing some utility functions for interacting with the Transkribus-API
Add a description, image, and links to the transkribus topic page so that developers can more easily learn about it.
To associate your repository with the transkribus topic, visit your repo's landing page and select "manage topics."