|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.opencms.search.extractors.A_CmsTextExtractor
org.opencms.search.extractors.A_CmsTextExtractorMsOfficeBase
public abstract class A_CmsTextExtractorMsOfficeBase
Base class to extract summary information from MS office documents.
| Field Summary | |
|---|---|
protected static String |
ENCODING_CP1252
Windows Cp1252 endocing (western europe) is used as default for single byte fields. |
protected static String |
ENCODING_UTF16
UTF-16 encoding is used for double byte fields. |
protected static String |
POWERPOINT_EVENT_NAME
Event event name for a MS PowerPoint document. |
protected static int |
PPT_TEXTBYTE_ATOM
PPT text byte atom. |
protected static int |
PPT_TEXTCHAR_ATOM
PPT text char atom. |
| Fields inherited from class org.opencms.search.extractors.A_CmsTextExtractor |
|---|
m_inputBuffer |
| Constructor Summary | |
|---|---|
A_CmsTextExtractorMsOfficeBase()
|
|
| Method Summary | |
|---|---|
protected void |
cleanup()
Cleans up some internal memory. |
protected Map |
extractMetaInformation()
Returns a map with the extracted meta information from the document. |
void |
processPOIFSReaderEvent(POIFSReaderEvent event)
|
| Methods inherited from class org.opencms.search.extractors.A_CmsTextExtractor |
|---|
extractText, extractText, extractText, extractText, getStreamCopy, removeControlChars |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
protected static final String ENCODING_CP1252
protected static final String ENCODING_UTF16
protected static final String POWERPOINT_EVENT_NAME
protected static final int PPT_TEXTBYTE_ATOM
protected static final int PPT_TEXTCHAR_ATOM
| Constructor Detail |
|---|
public A_CmsTextExtractorMsOfficeBase()
| Method Detail |
|---|
public void processPOIFSReaderEvent(POIFSReaderEvent event)
processPOIFSReaderEvent in interface POIFSReaderListenerPOIFSReaderListener.processPOIFSReaderEvent(org.apache.poi.poifs.eventfilesystem.POIFSReaderEvent)protected void cleanup()
protected Map extractMetaInformation()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||