topic Reading parsed content from PDF or DOC files in Alfresco Archive

Reading parsed content from PDF or DOC files

phani_av — Tue, 16 Mar 2010 02:50:49 GMT

Hi,I am trying to read the content in a particular file that has been uploaded into Alfresco. I understand that the text from DOC and PDF files is parsed and stored by Alfresco via Lucene. So for this I would like to know or have some sample code on how to get content on a particular file uploaded t

Re: Reading parsed content from PDF or DOC files

gauchoproluanco — Wed, 29 Sep 2010 11:39:58 GMT

Hi phani,

We are trying to do the same: get the content of an uploaded document in Alfresco. For the moment we only have a simple webScript that retrieves the content of the doc:

var elemento = args.id;

var busqueda = search.luceneSearch("+TYPE:\"cm:content\" +@sys\\:node-uuid:\""+elemento+"\"");

model.resultset = busqueda;‍‍‍‍‍

Our our problem is that we don't know how should the freemarker and the xml be? I mean, the reponse has to be xml, html?

Hope it helps!

Luis

ps: have you find an example of how to invoke the Content Retrieval Web Script?

Re: Reading parsed content from PDF or DOC files

gauchoproluanco — Thu, 30 Sep 2010 06:30:23 GMT

Hi again,

Finally we have been able to invoke the Retrieve Content Web Script in this way:

service/api/node/content/workspace/SpacesStore/uuid'>http://hostort/alfresco/service/api/node/content/workspace/SpacesStore/uuid

uuid is the uuid of the content that you want to download.

Hope it helps,

Luis