public final class SentenceTokenizer
extends org.apache.lucene.analysis.Tokenizer
The output tokens can then be broken into words with WordTokenFilter
WARNING: The status of the analyzers/smartcn analysis.cn.smart package is experimental. The APIs and file formats introduced here might change in the future and will not be supported anymore in such a case.
| Constructor and Description |
|---|
SentenceTokenizer(org.apache.lucene.util.AttributeSource.AttributeFactory factory,
Reader reader) |
SentenceTokenizer(org.apache.lucene.util.AttributeSource source,
Reader reader) |
SentenceTokenizer(Reader reader) |
| Modifier and Type | Method and Description |
|---|---|
void |
end() |
boolean |
incrementToken() |
void |
reset() |
void |
reset(Reader input) |
getOnlyUseNewAPI, next, next, setOnlyUseNewAPIaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toStringpublic SentenceTokenizer(Reader reader)
public SentenceTokenizer(org.apache.lucene.util.AttributeSource source,
Reader reader)
public SentenceTokenizer(org.apache.lucene.util.AttributeSource.AttributeFactory factory,
Reader reader)
public boolean incrementToken()
throws IOException
incrementToken in class org.apache.lucene.analysis.TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class org.apache.lucene.analysis.TokenStreamIOExceptionpublic void reset(Reader input) throws IOException
reset in class org.apache.lucene.analysis.TokenizerIOExceptionpublic void end()
throws IOException
end in class org.apache.lucene.analysis.TokenStreamIOExceptionCopyright © 2000-2012 Apache Software Foundation. All Rights Reserved.