public class MorfologikAnalyzer extends Analyzer
Analyzer using Morfologik library.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponentsGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY| Constructor and Description |
|---|
MorfologikAnalyzer(Version version)
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(Version version,
String dictionaryResource)
Builds an analyzer with an explicit dictionary resource.
|
| Modifier and Type | Method and Description |
|---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field,
Reader reader)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader. |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, tokenStream, tokenStreampublic MorfologikAnalyzer(Version version, String dictionaryResource)
version - Lucene compatibility versiondictionaryResource - A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info metadata file. See the Morfologik project
for details.public MorfologikAnalyzer(Version version)
protected Analyzer.TokenStreamComponents createComponents(String field, Reader reader)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader.createComponents in class Analyzerfield - ignored field namereader - source of tokensAnalyzer.TokenStreamComponents
built from an StandardTokenizer filtered with
StandardFilter and MorfologikFilter.Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.