|
|||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||
| Uses of TextExtractor in org.apache.jackrabbit.extractor |
|---|
| Subinterfaces of TextExtractor in org.apache.jackrabbit.extractor | |
|---|---|
interface |
DelegatingTextExtractor
Interface for text extractors that need to delegate the extraction of parts of content documents to another text extractor. |
| Classes in org.apache.jackrabbit.extractor that implement TextExtractor | |
|---|---|
class |
AbstractTextExtractor
Base class for text extractor implementations. |
class |
CompositeTextExtractor
Composite text extractor. |
class |
DefaultTextExtractor
Composite text extractor that by default contains the standard text extractors found in this package. |
class |
EmptyTextExtractor
Dummy text extractor that always returns and empty reader for all documents. |
class |
HTMLTextExtractor
Text extractor for HyperText Markup Language (HTML). |
class |
MsExcelTextExtractor
Text extractor for Microsoft Excel sheets. |
class |
MsOutlookTextExtractor
Text extractor for Microsoft Outlook messages. |
class |
MsPowerPointTextExtractor
Text extractor for Microsoft PowerPoint presentations. |
class |
MsWordTextExtractor
Text extractor for Microsoft Word documents. |
class |
OpenOfficeTextExtractor
Text extractor for OpenOffice documents. |
class |
PdfTextExtractor
Text extractor for Portable Document Format (PDF). |
class |
PlainTextExtractor
Text extractor for plain text. |
class |
PngTextExtractor
Text extractor for png/apng/mng images. |
class |
RTFTextExtractor
Text extractor for Rich Text Format (RTF) |
class |
XMLTextExtractor
Text extractor for XML documents. |
| Methods in org.apache.jackrabbit.extractor with parameters of type TextExtractor | |
|---|---|
void |
CompositeTextExtractor.addTextExtractor(TextExtractor extractor)
Adds a component text extractor. |
void |
DelegatingTextExtractor.setDelegateTextExtractor(TextExtractor extractor)
Sets the text textractor to which this extractor should delegate any partial text extraction tasks. |
|
|||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||