search for: cannotdistinguish

Displaying 1 result from an estimated 1 matches for "cannotdistinguish".

2011 Apr 07
1
GSOC 2011- CJK Support
...the indexing problem in Chinese version of help system for OpenOffice. The OpenOffice use Lucene to implement the indexing . I'll be happy to participate in this project during Google Summer ofCode 2011 program and implement CJK Support. As Chinese letters are not delimited by whitespace, we cannotdistinguish them easily. After my investigation, I find three methods to resolve this issue, and I prefer the last one. 1. Set each letter as a key to index, This is used by Lucene as default. The class is *StandardAnalyzer* 2. Every two letter as a key to index. This is used by Lucene for ?...