org.crosswire.jsword.index.lucene.analysis
Class ChineseLuceneAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.crosswire.jsword.index.lucene.analysis.AbstractBookAnalyzer
org.crosswire.jsword.index.lucene.analysis.ChineseLuceneAnalyzer
- All Implemented Interfaces:
- Closeable
public class ChineseLuceneAnalyzer
- extends AbstractBookAnalyzer
Uses org.apache.lucene.analysis.cn.ChineseAnalyzer Analysis:
ChineseTokenizer, ChineseFilter StopFilter, Stemming not implemented yet
Note: org.apache.lucene.analysis.cn.CJKAnalyzer takes overlapping two
character tokenization approach which leads to larger index size.
- Author:
- Sijo Cherian
- See Also:
The GNU Lesser General Public License for details.
Field Summary |
private org.apache.lucene.analysis.cn.ChineseAnalyzer |
myAnalyzer
|
Fields inherited from class org.apache.lucene.analysis.Analyzer |
overridesTokenStreamMethod |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setOverridesTokenStreamMethod, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
myAnalyzer
private org.apache.lucene.analysis.cn.ChineseAnalyzer myAnalyzer
ChineseLuceneAnalyzer
public ChineseLuceneAnalyzer()
tokenStream
public final org.apache.lucene.analysis.TokenStream tokenStream(String fieldName,
Reader reader)
- Specified by:
tokenStream
in class org.apache.lucene.analysis.Analyzer
reusableTokenStream
public final org.apache.lucene.analysis.TokenStream reusableTokenStream(String fieldName,
Reader reader)
throws IOException
- Overrides:
reusableTokenStream
in class org.apache.lucene.analysis.Analyzer
- Throws:
IOException