Slovenské vzory dělení: čas pro změnu?
Title in English | Slovak hyphenation patterns: a time for change? |
---|---|
Authors | |
Year of publication | 2004 |
Type | Article in Periodical |
Magazine / Source | Zpravodaj CSTUG |
MU Faculty or unit | |
Citation | |
Web | |
Doi | http://dx.doi.org/10.5300/2004-3-4/183 |
Field | Use of computers, robotics and its application |
Keywords | hyphenation; hyphenation patterns; patgen; syllabification; Unicode; TeX; syllabic hyphenation; Czech; Slovak |
Description | Hyphenation, or more generally algorithmic segmentation of big wordlist of some language is frequent problem. For Slovak language, there is only version based on the syllable principle available, without coverage of many exceptions. From a wordlist of million collected words we have generated by the PatGen program new freely available patterns that fill this gap. The result is directly usable not only in TeX distributions, but in other systems as well (OpenOffice.org). The techniques of bootstrapping, stratification and patterns generation are handy for solution of plenty of various segmentation tasks. |
Related projects: |