<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Using alfresco-simple-ocr to get the content from a pdf to create a text document in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49039#M19004</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I all! I'm evaluating Alfresco Community Edition to use in our organization, currently I have installed the 5.2 version on a&amp;nbsp; Ubuntu Server&amp;nbsp;16.04 working with alfresco-simple-ocr using pdfsandwich&amp;nbsp;0.1.4.esearch I&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I like to know if there is some way to get the content from a image or pdf file to use it as content for a new document. After some&amp;nbsp;research I found a few reference to &lt;A class="link-titled" href="https://docs.alfresco.com/5.2/references/dev-extension-points-content-transformer.html" title="https://docs.alfresco.com/5.2/references/dev-extension-points-content-transformer.html" rel="nofollow noopener noreferrer"&gt;Content Transformers (and Renditions) &lt;/A&gt;&amp;nbsp;but before continue I like to know if&amp;nbsp; no one already do that and if not its it the correct path to follow? I'm new here so any clue is appreciated.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Cheers.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;nueces...&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 05 Dec 2018 22:23:52 GMT</pubDate>
    <dc:creator>nueces</dc:creator>
    <dc:date>2018-12-05T22:23:52Z</dc:date>
    <item>
      <title>Using alfresco-simple-ocr to get the content from a pdf to create a text document</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49039#M19004</link>
      <description>I all! I'm evaluating Alfresco Community Edition to use in our organization, currently I have installed the 5.2 version on a&amp;nbsp; Ubuntu Server&amp;nbsp;16.04 working with alfresco-simple-ocr using pdfsandwich&amp;nbsp;0.1.4.esearch II like to know if there is some way to get the content from a image or pdf file to use i</description>
      <pubDate>Wed, 05 Dec 2018 22:23:52 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49039#M19004</guid>
      <dc:creator>nueces</dc:creator>
      <dc:date>2018-12-05T22:23:52Z</dc:date>
    </item>
    <item>
      <title>Re: Using alfresco-simple-ocr to get the content from a pdf to create a text document</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49040#M19005</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;There can be two reasons to integrate OCR with alfresco, either to make the image with text searchable or to capture specific information from the document in order to do the further operations based on that.&lt;/P&gt;&lt;P&gt;If you just simply want to make images containing text searchable then follow&amp;nbsp;&lt;A class="link-titled" href="http://www.contcentric.com/configuring-ocr-in-alfresco/" title="http://www.contcentric.com/configuring-ocr-in-alfresco/" rel="nofollow noopener noreferrer"&gt;Configuring OCR in Alfresco | ContCentric&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please mention, for what reason you need OCR with Alfresco?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.contcentric.com" rel="nofollow noopener noreferrer"&gt;ContCentric&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 07 Dec 2018 06:13:55 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49040#M19005</guid>
      <dc:creator>kintu_barot</dc:creator>
      <dc:date>2018-12-07T06:13:55Z</dc:date>
    </item>
    <item>
      <title>Re: Using alfresco-simple-ocr to get the content from a pdf to create a text document</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49041#M19006</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The second one. Capture the content from the image/pdf and generate a new text based document.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;thanks.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 07 Dec 2018 15:29:48 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/using-alfresco-simple-ocr-to-get-the-content-from-a-pdf-to/m-p/49041#M19006</guid>
      <dc:creator>nueces</dc:creator>
      <dc:date>2018-12-07T15:29:48Z</dc:date>
    </item>
  </channel>
</rss>

