JBIG2 Supported by OCR
Authors | |
---|---|
Year of publication | 2012 |
Type | Article in Proceedings |
Conference | DML 2012: Towards a Digital Mathematics Library |
MU Faculty or unit | |
Citation | |
Field | Informatics |
Keywords | jbig2enc; JBIG2; PDF size optimization; compression; DML; OCR; pdfJbIm; DML-CZ; EuDML |
Description | Digital Mathematical libraries contain a large volume of PDF documents containing scanned text. In this paper, we describe how this documents can be compressed and thus provide them more effectively to the users. We introduce a JBIG2 standard for compressing bitonal images such as scanned text and we discuss issues if OCR is used for improving the compression ratio of jbig2enc open-source encoder. For this purpose, we have designed API for using OCR in jbig2enc which we describe in this paper together with already achieved results. |
Related projects: |