public class PersianLuceneAnalyzer extends AbstractBookAnalyzer
TokenStream
is built from a
ArabicLetterTokenizer
filtered with LowerCaseFilter
,
ArabicNormalizationFilter
, PersianNormalizationFilter
and
Persian StopFilter
(optional)The GNU Lesser General Public License for details.
Modifier and Type | Field and Description |
---|---|
private org.apache.lucene.util.Version |
matchVersion |
book, doStemming, doStopWords, stopSet
Constructor and Description |
---|
PersianLuceneAnalyzer() |
Modifier and Type | Method and Description |
---|---|
org.apache.lucene.analysis.TokenStream |
reusableTokenStream(String fieldName,
Reader reader)
Returns a (possibly reused)
TokenStream which tokenizes all the
text in the provided Reader . |
org.apache.lucene.analysis.TokenStream |
tokenStream(String fieldName,
Reader reader) |
getBook, getDoStopWords, setBook, setDoStemming, setDoStopWords, setStopWords
public final org.apache.lucene.analysis.TokenStream tokenStream(String fieldName, Reader reader)
tokenStream
in class org.apache.lucene.analysis.Analyzer
public org.apache.lucene.analysis.TokenStream reusableTokenStream(String fieldName, Reader reader) throws IOException
TokenStream
which tokenizes all the
text in the provided Reader
.reusableTokenStream
in class org.apache.lucene.analysis.Analyzer
TokenStream
built from a ArabicLetterTokenizer
filtered with LowerCaseFilter
,
ArabicNormalizationFilter
,
PersianNormalizationFilter
and Persian Stop wordsIOException