<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Reading parsed content from PDF or DOC files in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226759#M179889</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I am trying to read the content in a particular file that has been uploaded into Alfresco. I understand that the text from DOC and PDF files is parsed and stored by Alfresco via Lucene. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;So for this I would like to know or have some sample code on how to get content on a particular file uploaded to Alfresco. For this I am looking at using the Content Retrieval CMIS webscript provided OOTB.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;However, I feel there is not enough documentation around the parameters that need to be passed in. I can read the parameters but there is no enough description on what those values would or should be. A sample call to this web script would really be useful to compensate the lack of enough documentation.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Also, I would like to know what are the other web scripts or any other ways that would allow me to read the actual content on a node.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Phani&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 16 Mar 2010 02:50:49 GMT</pubDate>
    <dc:creator>phani_av</dc:creator>
    <dc:date>2010-03-16T02:50:49Z</dc:date>
    <item>
      <title>Reading parsed content from PDF or DOC files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226759#M179889</link>
      <description>Hi,I am trying to read the content in a particular file that has been uploaded into Alfresco. I understand that the text from DOC and PDF files is parsed and stored by Alfresco via Lucene. So for this I would like to know or have some sample code on how to get content on a particular file uploaded t</description>
      <pubDate>Tue, 16 Mar 2010 02:50:49 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226759#M179889</guid>
      <dc:creator>phani_av</dc:creator>
      <dc:date>2010-03-16T02:50:49Z</dc:date>
    </item>
    <item>
      <title>Re: Reading parsed content from PDF or DOC files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226760#M179890</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi phani,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We are trying to do the same: get the content of an uploaded document in Alfresco. For the moment we only have a simple webScript that retrieves the content of the doc: &lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;var elemento = args.id;&lt;BR /&gt;&lt;BR /&gt;var busqueda = search.luceneSearch("+TYPE:\"cm:content\" +@sys\\:node-uuid:\""+elemento+"\"");&lt;BR /&gt;&lt;BR /&gt;model.resultset = busqueda;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;Our our problem is that we don't know how should the freemarker and the xml be? I mean, the reponse has to be xml, html?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Hope it helps!&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Luis&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;ps: have you find an example of how to invoke the Content Retrieval Web Script?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 29 Sep 2010 11:39:58 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226760#M179890</guid>
      <dc:creator>gauchoproluanco</dc:creator>
      <dc:date>2010-09-29T11:39:58Z</dc:date>
    </item>
    <item>
      <title>Re: Reading parsed content from PDF or DOC files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226761#M179891</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi again, &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Finally we have been able to invoke the Retrieve Content Web Script in this way:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;A _jive_internal="true" href="https://community.alfresco.com/host:port/alfresco" rel="nofollow noopener noreferrer"&gt;&lt;STRONG&gt;service/api/node/content/workspace/SpacesStore/uuid&lt;/STRONG&gt;'&amp;gt;http://host&lt;img id="smileytongue" class="emoticon emoticon-smileytongue" src="https://connect.hyland.com/i/smilies/16x16_smiley-tongue.png" alt="Smiley Tongue" title="Smiley Tongue" /&gt;ort/alfresco/&lt;STRONG&gt;service/api/node/content/workspace/SpacesStore/uuid&lt;/STRONG&gt;&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;uuid&lt;/STRONG&gt;&lt;SPAN&gt; is the uuid of the content that you want to download.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Hope it helps, &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Luis&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Sep 2010 06:30:23 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/reading-parsed-content-from-pdf-or-doc-files/m-p/226761#M179891</guid>
      <dc:creator>gauchoproluanco</dc:creator>
      <dc:date>2010-09-30T06:30:23Z</dc:date>
    </item>
  </channel>
</rss>

