The Art of Mathematics Retrieval (invited talk at Informatics Colloquium FI MU, 8.11.2011)
Název česky | Umění vyhledávání matematiky (zvaná přednáška na Informatickém kolokviu FI MU, 8.11.2011) |
---|---|
Autoři | |
Rok publikování | 2011 |
Druh | Vyžádané přednášky |
Fakulta / Pracoviště MU | |
Citace | |
Popis | The design and architecture of MIaS (Math Indexer and Searcher), a~system for mathematics retrieval is presented, and design decisions are discussed. We argue for an approach based on Presentation MathML using a~similarity of math subformulae. The system was implemented as a~math-aware search engine based on the state-of-the-art system Apache Lucene and is used in The European Digital Mathematics Library - EuDML. Scalability issues were checked against more than 400,000 arXiv documents with 158 million mathematical formulae. Almost three billion MathML subformulae were indexed using a~Solr-compatible Lucene. |
Související projekty: |