Towards Machine-Actionable Modules of a Digital Mathematics Library: The Example of DML-CZ

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

RŮŽIČKA Michal SOJKA Petr KREJČÍŘ Vlastimil

Year of publication 2013
Type Article in Proceedings
Conference CICM 2013, LNAI 7961
MU Faculty or unit

Faculty of Informatics

Citation
Web
Doi http://dx.doi.org/10.1007/978-3-642-39320-4_17
Field Informatics
Keywords DML-CZ; EuDML; DOI; ParsCit; references; validation; DSpace; OAI-PMH; TeX; LaTeX; Tralics; Infty; machine-actionable digital library; library automation; Google Scholar; webometrics
Description Publishing and archiving mathematical literature presents its own sets of problems. Reaching the goal of building global digital mathematics library (DML), smaller DMLs play an inevitable role in collecting, validating, digitizing and checking data from smaller publishers. In this paper, we overview the technical challenges of building a machine-actionable set of modules we have developed over almost a decade of evolution of the Czech Digital Mathematics Library (DML-CZ). Firstly, we survey methods of effective automated data acquisition from the content providers. Then we show OCR processing of mathematical documents and automated segmentation of plain text references for metadata enhancement and effective DOI look up. Finally we describe connection to the European Digital Mathematics Library (EuDML) project and public interfaces of DML-CZ for the best visibility and accessibility.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.