Association Analyzer Implementation: State of the Art: Deliverable 8.1 of project EuDML

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

LEE Mark SOJKA Petr SORGE Volker BAKER Josef HURY Wojtek BOLIKOWSKI Łukasz

Year of publication 2010
MU Faculty or unit

Faculty of Informatics

Citation
Description This report focuses on two key technologies: Citation Indexing and Document Clustering. Citation Indexing concerns the automatic parsing and linking of citations to create a network of documents within the collection. This technology is well established in digital libraries and searchable archives such as CiteSeerX, Google Scholar, general projects as DRIVER, and mathematical specific digital libraries such as NUMDAM, DML-CZ or referative databases Zentralblatt MATH and Mathematical Reviews. Document Classification and Clustering are also established technologies within Information Retrieval but have not to date been widely used within digital libraries. In particular, there is very little previous work applying classification and clustering techniques to mathematical documents. However, initial research appears promising and we believe that the addition of these technologies will allow facilities beyond the current state of the art.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.