其他
100% pure java project to integrate hunspell .aff/.dic files with Apache Lucene.
We aim to provide features such as stemming, decompounding, spellchecking, normalization, term expansion, etc. that take advantage of the existing lexical resources already created and widely-used in projects like OpenOffice (Available support by language)
These files are commonly used for spellchecking purposes, but many have a wide range of uses for word analysis, and the necessarily language-specific support is represented in the files themselves.
For more background on how these resources can be used for open-source word analysis, see these papers: * Hunmorph: open source word analysis * Leveraging the open source ispell codebase for minority language an
暂无评论