Text Corpora


Fact Checking Dataset

Sentiment Analysis Datasets

Czech Text Document Corpus

Czech Historical Named Entity Corpus

OCR Corpora and Tools

Text processing logo OCR logo

Image Corpora


ChronSeg: Dataset for Segmentation of Handwritten Historical Chronicles

Unconstrained Facial Images: Database for Face Recognition under Real-world Conditions

Img processing logo Faces logo

Historical Maps Corpora


Historical Map Dataset: Dataset for Detection and Segmentation tasks in Historical Maps

Nomenclature Dataset: Dataset for Detection and Recognition of Handwritten Nomenclatures and toponyms from Historical Cadastral Maps

Map processing logo Map processing logo