Building the Ultimate Math Search Engine

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

LÍŠKA Martin SOJKA Petr

Year of publication 2015
MU Faculty or unit

Faculty of Informatics

Citation
Description Mathematics information retrieval (MIR) is a domain specific branch of Information Retrieval. MIR aims at searching information in documents with significant amount of mathematical content in the form of expressions and formulae. Based on the newly established international MIR evaluation forum and on the number of MIR related research groups around the world, it is definitely on the rise. In this work I have summarized and compared different approaches to math-aware search systems. More detailed description of Math Indexer and Searcher (MIaS) was provided as this is our system created at Faculty of Informatics, Masaryk University, primarily designed and developed by me. MIaS is currently reported as the best performing MIR system in terms of effectiveness. In this work I proposed several topics which are main research interests of my studies. The topics correlate with possible features that can improve the effectiveness of MIR systems. Namely, the proposed topics are math formula substree unification, integration of algebraic computational power into the indexing as well as searching phase, query expansion as a way of increasing recall, query variables, combination of more approaches within one system and a utilization of combination of text and math search. One topic that spans over all other topics is evaluation which is a necessity in a process of continuous improvement of effectiveness.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.