Paper details

Title: Automated Extraction of Labels from Large-Scale Historical Maps

Author: Inga Schlegel

Abstract: Obtained from CrossRef

Abstract. Historical maps are frequently neither readable, searchable nor analyzable by machines due to lacking databases or ancillary information about their content. Identifying and annotating map labels is seen as a first step towards an automated legibility of those. This article investigates a universal and transferable methodology for the work with large-scale historical maps and their comparability to others while reducing manual intervention to a minimum. We present an end-to-end approach which increases the number of true positive identified labels by combining available text detection, recognition, and similarity measuring tools with own enhancements. The comparison of recognized historical with current street names produces a satisfactory accordance which can be used to assign their point-like representatives within a final rough georeferencing. The demonstrated workflow facilitates a spatial orientation within large-scale historical maps by enabling the establishment of relating databases. Assigning the identified labels to the geometries of related map features may contribute to machine-readable and analyzable historical maps.

Codecheck details

Certificate identifier: 2021-007

Codechecker name: Daniel Nüst

Time of codecheck: 2021-06-10 12:00:00

Repository: https://osf.io/anv9r

Codecheck report: https://doi.org/10.17605/osf.io/anv9r

Summary:

The provided workflow could be partially reproduced, notably the text detection and string similarity calculations could be executed and provide sufficiently similar results. Some manual steps were included which could not be reproduced, as well as a final step due to long execution time. The software stack was challenging to install but overall well documented.


https://codecheck.org.uk/ | GitHub codecheckers

© Stephen Eglen & Daniel Nüst

Published under CC BY-SA 4.0

DOI of Zenodo Deposit

CODECHECK is a process for independent execution of computations underlying scholarly research articles.