Read document content (doc, docx, odt)
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-28-2009 12:15 PM
Hi all,
In my first application I've read document content using Alfresco Web Service and apache POI 3.5 and PdfBox libs.
Now I'm developing an action for alfresco 3.2, so I cannot use POI 3.5 because alfresco already contains the 3.1 version.
For pdf documents there's no problem….because I can use the ContentReader inputStream in pdfbox and convert it into plain text (String).
But for doc/docx/odt? How can I read the document content?
Thanks in advance,
Revenge
In my first application I've read document content using Alfresco Web Service and apache POI 3.5 and PdfBox libs.
Now I'm developing an action for alfresco 3.2, so I cannot use POI 3.5 because alfresco already contains the 3.1 version.
For pdf documents there's no problem….because I can use the ContentReader inputStream in pdfbox and convert it into plain text (String).
But for doc/docx/odt? How can I read the document content?
Thanks in advance,
Revenge
Labels:
- Labels:
-
Archive
10 REPLIES 10

Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-25-2014 05:38 AM
Can anyone tell me how to make use of ContentReader. Do I need to include any library?
