<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Configuring OCR in Alfresco - Alfresco Community 5.2 in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/489269#M40085</link>
    <description>&lt;P class=""&gt;Hi,&lt;/P&gt;&lt;P class=""&gt;I am currently working on integrating OCR functionality into &lt;STRONG&gt;Alfresco 7.2, running on a Windows Server. I have successfully installed the following dependencies:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Tesseract&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Ghostscript&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;OCRmyPDF&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;I have placed the required JAR files:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-repo-2.3.1.jar&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-share-2.3.1.jar&lt;/P&gt;&lt;P class=""&gt;into the appropriate &lt;STRONG&gt;platform and share directories of the Alfresco installation.&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;The following properties have been added to the alfresco-global.properties file:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.command=C:/Users/admin/AppData/Roaming/Python/Python313/Scripts/ocrmypdf.exe ocr.output.verbose=true&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.output.file.prefix.command=&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.extra.commands=--verbose 1 --force-ocr --deskew -l eng+spa+fra ocr.server.os=windows&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;However, when I attempt to use the &lt;STRONG&gt;OCR feature from the document details section, I encounter the following error:&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;Exception in thread "defaultAsyncAction1" java.lang.RuntimeException: java.lang.RuntimeException: java.lang.RuntimeException: java.lang.IllegalArgumentException: Invalid uri '${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS72%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path at es.keensoft.alfresco.ocr.OCRExtractAction.executeImplInternal(OCRExtractAction.java:183) ... Caused by: java.lang.IllegalArgumentException: Invalid uri '${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;P class=""&gt;I would appreciate your assistance in identifying the cause and guiding me toward a resolution. Please let me know if you require any further logs, configuration files, or additional details.&lt;/P&gt;&lt;P class=""&gt;Thank you in advance for your support.&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
    <pubDate>Thu, 24 Apr 2025 07:44:20 GMT</pubDate>
    <dc:creator>ShivanandaL</dc:creator>
    <dc:date>2025-04-24T07:44:20Z</dc:date>
    <item>
      <title>Configuring OCR in Alfresco - Alfresco Community 5.2</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110105#M30843</link>
      <description>Hi,I am successfully configured OCR with my alfresco (windows installation). But, it only working for PNG, TIFF, JPG &amp;amp; GPEG. But i need it for PDF extension also because most of scanned files are in pdf format.My&amp;nbsp;tesseract-ocr-transform-context.xml is,&amp;lt;?xml version='1.0' encoding='UTF-8'?&amp;gt;</description>
      <pubDate>Sun, 02 Jun 2019 15:29:26 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110105#M30843</guid>
      <dc:creator>anuradha1</dc:creator>
      <dc:date>2019-06-02T15:29:26Z</dc:date>
    </item>
    <item>
      <title>Re: Configuring OCR in Alfresco - Alfresco Community 5.2</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110106#M30844</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please refer&amp;nbsp;&lt;A class="link-titled" href="http://www.contcentric.com/configuring-ocr-in-alfresco/" title="http://www.contcentric.com/configuring-ocr-in-alfresco/" rel="nofollow noopener noreferrer"&gt;Configuring OCR in Alfresco | ContCentric&lt;/A&gt;&amp;nbsp;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Vidhi&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.contcentric.com/" rel="nofollow noopener noreferrer"&gt;ContCentric&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 04 Jun 2019 04:44:28 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110106#M30844</guid>
      <dc:creator>vidhipanchal</dc:creator>
      <dc:date>2019-06-04T04:44:28Z</dc:date>
    </item>
    <item>
      <title>Re: Configuring OCR in Alfresco - Alfresco Community 5.2</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/489269#M40085</link>
      <description>&lt;P class=""&gt;Hi,&lt;/P&gt;&lt;P class=""&gt;I am currently working on integrating OCR functionality into &lt;STRONG&gt;Alfresco 7.2, running on a Windows Server. I have successfully installed the following dependencies:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Tesseract&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Ghostscript&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;OCRmyPDF&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;I have placed the required JAR files:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-repo-2.3.1.jar&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-share-2.3.1.jar&lt;/P&gt;&lt;P class=""&gt;into the appropriate &lt;STRONG&gt;platform and share directories of the Alfresco installation.&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;The following properties have been added to the alfresco-global.properties file:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.command=C:/Users/admin/AppData/Roaming/Python/Python313/Scripts/ocrmypdf.exe ocr.output.verbose=true&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.output.file.prefix.command=&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.extra.commands=--verbose 1 --force-ocr --deskew -l eng+spa+fra ocr.server.os=windows&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;However, when I attempt to use the &lt;STRONG&gt;OCR feature from the document details section, I encounter the following error:&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;Exception in thread "defaultAsyncAction1" java.lang.RuntimeException: java.lang.RuntimeException: java.lang.RuntimeException: java.lang.IllegalArgumentException: Invalid uri '${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS72%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path at es.keensoft.alfresco.ocr.OCRExtractAction.executeImplInternal(OCRExtractAction.java:183) ... Caused by: java.lang.IllegalArgumentException: Invalid uri '${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;P class=""&gt;I would appreciate your assistance in identifying the cause and guiding me toward a resolution. Please let me know if you require any further logs, configuration files, or additional details.&lt;/P&gt;&lt;P class=""&gt;Thank you in advance for your support.&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Thu, 24 Apr 2025 07:44:20 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/489269#M40085</guid>
      <dc:creator>ShivanandaL</dc:creator>
      <dc:date>2025-04-24T07:44:20Z</dc:date>
    </item>
  </channel>
</rss>

