public class OfficeParser extends AbstractParser
| Modifier and Type | Class and Description |
|---|---|
static class |
OfficeParser.POIFSDocumentType |
| Constructor and Description |
|---|
OfficeParser() |
| Modifier and Type | Method and Description |
|---|---|
Set<MediaType> |
getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used
with the given parse context.
|
protected void |
parse(org.apache.poi.poifs.filesystem.DirectoryNode root,
ParseContext context,
Metadata metadata,
XHTMLContentHandler xhtml) |
void |
parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
parsepublic Set<MediaType> getSupportedTypes(ParseContext context)
Parsercontext - parse contextpublic void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException
stream - the document stream (input)handler - handler for the XHTML SAX events (output)metadata - document metadata (input and output)context - parse contextIOException - if the document stream could not be readSAXException - if the SAX events could not be processedTikaException - if the document could not be parsedprotected void parse(org.apache.poi.poifs.filesystem.DirectoryNode root,
ParseContext context,
Metadata metadata,
XHTMLContentHandler xhtml)
throws IOException,
SAXException,
TikaException
IOExceptionSAXExceptionTikaExceptionCopyright © 2007–2016 The Apache Software Foundation. All rights reserved.