| Modifier and Type | Field and Description |
|---|---|
protected BeamSearch<String> |
beam
The beam used to search for sequences of chunk tag assignments.
|
static int |
DEFAULT_BEAM_SIZE |
protected MaxentModel |
model
The model used to assign chunk tags to a sequence of tokens.
|
| Constructor and Description |
|---|
ChunkerME(ChunkerModel model)
Initializes the current instance with the specified model.
|
ChunkerME(ChunkerModel model,
int beamSize)
Initializes the current instance with the specified model and
the specified beam size.
|
ChunkerME(ChunkerModel model,
int beamSize,
SequenceValidator<String> sequenceValidator)
Deprecated.
Use
ChunkerME(ChunkerModel, int) instead
and use the ChunkerFactory to configure the SequenceValidator. |
ChunkerME(ChunkerModel model,
int beamSize,
SequenceValidator<String> sequenceValidator,
ChunkerContextGenerator contextGenerator)
Deprecated.
Use
ChunkerME(ChunkerModel, int) instead
and use the ChunkerFactory to configure the SequenceValidator and ChunkerContextGenerator. |
ChunkerME(MaxentModel mod)
Deprecated.
|
ChunkerME(MaxentModel mod,
ChunkerContextGenerator cg)
Deprecated.
|
ChunkerME(MaxentModel mod,
ChunkerContextGenerator cg,
int beamSize)
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
List<String> |
chunk(List<String> toks,
List<String> tags)
Deprecated.
|
String[] |
chunk(String[] toks,
String[] tags)
Generates chunk tags for the given sequence returning the result in an array.
|
Span[] |
chunkAsSpans(String[] toks,
String[] tags)
Generates tagged chunk spans for the given sequence returning the result in a span array.
|
double[] |
probs()
Returns an array with the probabilities of the last decoded sequence.
|
void |
probs(double[] probs)
Populates the specified array with the probabilities of the last decoded sequence.
|
Sequence[] |
topKSequences(List<String> sentence,
List<String> tags)
Deprecated.
|
Sequence[] |
topKSequences(String[] sentence,
String[] tags)
Returns the top k chunk sequences for the specified sentence with the specified pos-tags
|
Sequence[] |
topKSequences(String[] sentence,
String[] tags,
double minSequenceScore)
Returns the top k chunk sequences for the specified sentence with the specified pos-tags
|
static ChunkerModel |
train(String lang,
ObjectStream<ChunkSample> in,
ChunkerContextGenerator contextGenerator,
TrainingParameters mlParams)
Deprecated.
Use
#train(String, ObjectStream, ChunkerContextGenerator, TrainingParameters, ChunkerFactory)
instead. |
static ChunkerModel |
train(String lang,
ObjectStream<ChunkSample> in,
int cutoff,
int iterations)
Deprecated.
use
train(String, ObjectStream, ChunkerContextGenerator, TrainingParameters)
instead and pass in a TrainingParameters object. |
static ChunkerModel |
train(String lang,
ObjectStream<ChunkSample> in,
int cutoff,
int iterations,
ChunkerContextGenerator contextGenerator)
Deprecated.
use
train(String, ObjectStream, ChunkerContextGenerator, TrainingParameters)
instead and pass in a TrainingParameters object. |
static ChunkerModel |
train(String lang,
ObjectStream<ChunkSample> in,
TrainingParameters mlParams,
ChunkerFactory factory) |
public static final int DEFAULT_BEAM_SIZE
protected BeamSearch<String> beam
protected MaxentModel model
public ChunkerME(ChunkerModel model, int beamSize, SequenceValidator<String> sequenceValidator, ChunkerContextGenerator contextGenerator)
ChunkerME(ChunkerModel, int) instead
and use the ChunkerFactory to configure the SequenceValidator and ChunkerContextGenerator.model - The model for this chunker.beamSize - The size of the beam that should be used when decoding sequences.sequenceValidator - The SequenceValidator to determines whether the outcome
is valid for the preceding sequence. This can be used to implement constraints
on what sequences are valid.public ChunkerME(ChunkerModel model, int beamSize, SequenceValidator<String> sequenceValidator)
ChunkerME(ChunkerModel, int) instead
and use the ChunkerFactory to configure the SequenceValidator.model - The model for this chunker.beamSize - The size of the beam that should be used when decoding sequences.sequenceValidator - The SequenceValidator to determines whether the outcome
is valid for the preceding sequence. This can be used to implement constraints
on what sequences are valid.public ChunkerME(ChunkerModel model, int beamSize)
model - The model for this chunker.beamSize - The size of the beam that should be used when decoding sequences.public ChunkerME(ChunkerModel model)
model - @Deprecated public ChunkerME(MaxentModel mod)
mod - The maximum entropy model for this chunker.@Deprecated public ChunkerME(MaxentModel mod, ChunkerContextGenerator cg)
mod - The maximum entropy model for this chunker.cg - The context generator to be used by the specified model.@Deprecated public ChunkerME(MaxentModel mod, ChunkerContextGenerator cg, int beamSize)
mod - The maximum entropy model for this chunker.cg - The context generator to be used by the specified model.beamSize - The size of the beam that should be used when decoding sequences.@Deprecated public List<String> chunk(List<String> toks, List<String> tags)
Chunkerpublic String[] chunk(String[] toks, String[] tags)
Chunkerpublic Span[] chunkAsSpans(String[] toks, String[] tags)
ChunkerchunkAsSpans in interface Chunkertoks - an array of the tokens or words of the sequence.tags - an array of the pos tags of the sequence.@Deprecated public Sequence[] topKSequences(List<String> sentence, List<String> tags)
ChunkertopKSequences in interface Chunkersentence - The tokens of the sentence.tags - The pos-tags for the specified sentence.public Sequence[] topKSequences(String[] sentence, String[] tags)
ChunkertopKSequences in interface Chunkersentence - The tokens of the sentence.tags - The pos-tags for the specified sentence.public Sequence[] topKSequences(String[] sentence, String[] tags, double minSequenceScore)
ChunkertopKSequences in interface Chunkersentence - The tokens of the sentence.tags - The pos-tags for the specified sentence.minSequenceScore - A lower bound on the score of a returned sequence.public void probs(double[] probs)
chunk. The
specified array should be at least as large as the numbe of tokens in the previous call to chunk.probs - An array used to hold the probabilities of the last decoded sequence.public double[] probs()
chunk.chunk
when it was last called.public static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, TrainingParameters mlParams, ChunkerFactory factory) throws IOException
IOExceptionpublic static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, ChunkerContextGenerator contextGenerator, TrainingParameters mlParams) throws IOException
#train(String, ObjectStream, ChunkerContextGenerator, TrainingParameters, ChunkerFactory)
instead.IOExceptionpublic static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, int cutoff, int iterations, ChunkerContextGenerator contextGenerator) throws IOException
train(String, ObjectStream, ChunkerContextGenerator, TrainingParameters)
instead and pass in a TrainingParameters object.IOException@Deprecated public static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, int cutoff, int iterations) throws IOException, ObjectStreamException
train(String, ObjectStream, ChunkerContextGenerator, TrainingParameters)
instead and pass in a TrainingParameters object.ChunkerME.in - cutoff - iterations - IOExceptionObjectStreamExceptionCopyright © 2019 The Apache Software Foundation. All rights reserved.