Nomenclature Dataset serves for detection and recognition of so-called nomenclatures in historical cadastral maps. The nomenclature is a handwritten piece of text which identifies the position of the individual map sheet in the grid coverinfg a larger region. It covers two tasks: 1. Nomeclature Detection - finding the exact position of the nomenclature text within the map sheet 2. Nomenclature Recognition - transcribe the nomenclature by the means of optical charactre recognition (OCR) or handwritten text recognition (HTR) It is freely available for education and research purposes. However, any other use is strictly excluded!
The dataset contains 800 map sheets in total. It is divided into training, testing and validation parts that contain 650, 100 and 50 sheets respectively.
Two files with a same name are provided for each map sheet:
This dataset is licensed under the Attribution-NonCommercial-ShareAlike 4.0 International, so commercial use in any form is excluded.
Please, cite this paper when you used this dataset in your experiments.
If you have additional questions / comments related to this dataset, please, do not hesitate to contact the authors: Ladislav Lenc llenc@kiv.zcu.cz or Pavel Král pkral@kiv.zcu.cz.