|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
public class QueryAutoStopWordAnalyzer
| Field Summary | |
|---|---|
static float |
defaultMaxDocFreqPercent
|
| Constructor Summary | |
|---|---|
QueryAutoStopWordAnalyzer(Analyzer delegate)
Initializes this analyzer with the Analyzer object that actual produces the tokens |
|
| Method Summary | |
|---|---|
int |
addStopWords(IndexReader reader)
Automatically adds stop words for all fields with terms exceeding the defaultMaxDocFreqPercent |
int |
addStopWords(IndexReader reader,
float maxPercentDocs)
Automatically adds stop words for all fields with terms exceeding the maxDocFreqPercent |
int |
addStopWords(IndexReader reader,
int maxDocFreq)
Automatically adds stop words for all fields with terms exceeding the maxDocFreqPercent |
int |
addStopWords(IndexReader reader,
String fieldName,
float maxPercentDocs)
Automatically adds stop words for the given field with terms exceeding the maxPercentDocs |
int |
addStopWords(IndexReader reader,
String fieldName,
int maxDocFreq)
Automatically adds stop words for the given field with terms exceeding the maxPercentDocs |
Term[] |
getStopWords()
Provides information on which stop words have been identified for all fields |
String[] |
getStopWords(String fieldName)
Provides information on which stop words have been identified for a field |
TokenStream |
tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader. |
| Methods inherited from class org.apache.lucene.analysis.Analyzer |
|---|
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final float defaultMaxDocFreqPercent
| Constructor Detail |
|---|
public QueryAutoStopWordAnalyzer(Analyzer delegate)
delegate - The choice of analyzer that is used to produce the token stream which needs filtering| Method Detail |
|---|
public int addStopWords(IndexReader reader)
throws IOException
reader - The IndexReader class which will be consulted to identify potential stop words that
exceed the required document frequency
IOException
public int addStopWords(IndexReader reader,
int maxDocFreq)
throws IOException
reader - The IndexReader class which will be consulted to identify potential stop words that
exceed the required document frequencymaxDocFreq - The maximum number of index documents which can contain a term, after which
the term is considered to be a stop word
IOException
public int addStopWords(IndexReader reader,
float maxPercentDocs)
throws IOException
reader - The IndexReader class which will be consulted to identify potential stop words that
exceed the required document frequencymaxPercentDocs - The maximum percentage (between 0.0 and 1.0) of index documents which
contain a term, after which the word is considered to be a stop word.
IOException
public int addStopWords(IndexReader reader,
String fieldName,
float maxPercentDocs)
throws IOException
reader - The IndexReader class which will be consulted to identify potential stop words that
exceed the required document frequencyfieldName - The field for which stopwords will be addedmaxPercentDocs - The maximum percentage (between 0.0 and 1.0) of index documents which
contain a term, after which the word is considered to be a stop word.
IOException
public int addStopWords(IndexReader reader,
String fieldName,
int maxDocFreq)
throws IOException
reader - The IndexReader class which will be consulted to identify potential stop words that
exceed the required document frequencyfieldName - The field for which stopwords will be addedmaxDocFreq - The maximum number of index documents which
can contain a term, after which the term is considered to be a stop word.
IOException
public TokenStream tokenStream(String fieldName,
Reader reader)
Analyzer
tokenStream in class Analyzerpublic String[] getStopWords(String fieldName)
fieldName - The field for which stop words identified in "addStopWords"
method calls will be returned
public Term[] getStopWords()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||