public class DirContentSource extends ContentSource
ContentSource using the Dir collection for its input. Supports
the following configuration parameters (on top of ContentSource):
HTMLParser class to use for
parsing the TREC documents content (default=DemoHTMLParser).
| Modifier and Type | Class and Description |
|---|---|
static class |
DirContentSource.Iterator |
BUFFER_SIZE, encoding, forever, logStep, verbose| Constructor and Description |
|---|
DirContentSource() |
| Modifier and Type | Method and Description |
|---|---|
void |
close()
Called when reading from this content source is no longer required.
|
DocData |
getNextDocData(DocData docData)
Returns the next
DocData from the content source. |
void |
resetInputs()
Resets the input for this content source, so that the test would behave as
if it was just started, input-wise.
|
void |
setConfig(Config config)
Sets the
Config for this content source. |
addBytes, addDoc, collectFiles, getBytesCount, getConfig, getDocsCount, getInputStream, getTotalBytesCount, getTotalDocsCount, shouldLogpublic void close()
throws IOException
ContentSourceclose in class ContentSourceIOExceptionpublic DocData getNextDocData(DocData docData) throws NoMoreDataException, IOException
ContentSourceDocData from the content source.getNextDocData in class ContentSourceNoMoreDataExceptionIOExceptionpublic void resetInputs()
throws IOException
ContentSourceNOTE: the default implementation resets the number of bytes and documents generated since the last reset, so it's important to call super.resetInputs in case you override this method.
resetInputs in class ContentSourceIOExceptionpublic void setConfig(Config config)
ContentSourceConfig for this content source. If you override this
method, you must call super.setConfig.setConfig in class ContentSourceCopyright © 2000-2012 Apache Software Foundation. All Rights Reserved.