<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104658#M29676</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;A href="https://migration33.stage.lithium.com/t5/user/viewprofilepage/user-id/52129"&gt;@guilhermekellin&lt;/A&gt;,&lt;/P&gt;
&lt;P&gt;Great that&amp;nbsp;&lt;A href="https://migration33.stage.lithium.com/t5/user/viewprofilepage/user-id/82140"&gt;@SriramG&lt;/A&gt;&amp;nbsp;was able to help you resolve your problem - thanks for reporting back. I've marked this as solved.&lt;/P&gt;
&lt;P&gt;Kind regards,&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 16 Jul 2020 13:33:38 GMT</pubDate>
    <dc:creator>EddieMay</dc:creator>
    <dc:date>2020-07-16T13:33:38Z</dc:date>
    <item>
      <title>OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104654#M29672</link>
      <description>&lt;P&gt;With the approach suggested by Fedorow in &lt;A href="https://hub.alfresco.com/t5/alfresco-content-services-forum/quot-ocr-extract-quot-action-doesn-t-work-well-alfresco-simple/m-p/299661#M19845" target="_blank" rel="noopener nofollow noreferrer"&gt;https://hub.alfresco.com/t5/alfresco-content-services-forum/quot-ocr-extract-quot-action-doesn-t-work-well-alfresco-simple/m-p/299661#M19845&lt;/A&gt;, I was able to make OCR work with Alfresco 6.1.0 and Docker.&lt;/P&gt;&lt;P&gt;I updated&lt;EM&gt;ocr_input&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;and&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;/ocr_output&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;to use /usr/local/tomcat/ocr_input&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;and /usr/local/tomcat/ocr_out so that alfresco container can access these folders without any access issues.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Thanks Fedorow&lt;/P&gt;&lt;P&gt;Below are the changes done to docker-compose.yml and ocrmypdf.sh&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;docker-compose.yml&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;...&lt;BR /&gt;services:&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;alfresco:&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;...&lt;BR /&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;volumes:&lt;BR /&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; - ocr-input:&lt;FONT color="#FF0000"&gt;/usr/local/tomcat/ocr_input&lt;/FONT&gt;&lt;BR /&gt;         - ocr-output:&lt;FONT color="#FF0000"&gt;/usr/local/tomcat/ocr_output&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;...&lt;BR /&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;ocrmypdf:&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;...&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; volumes:&lt;BR /&gt;             - ocr-input:&lt;FONT color="#FF0000"&gt;/usr/local/tomcat/ocr_input&lt;/FONT&gt;&lt;BR /&gt;             - ocr-output:/&lt;FONT color="#FF0000"&gt;usr/local/tomcat/ocr_output&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;...&lt;BR /&gt;volumes:&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;...&lt;BR /&gt;&amp;nbsp; &lt;FONT color="#FF0000"&gt;ocr-input:&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;       external: true&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;  ocr-output:&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;       external: true&lt;/FONT&gt;&lt;BR /&gt;...&lt;BR /&gt;&lt;BR /&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;EM&gt;bin/ocrmypdf.sh&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;#!/bin/bash&lt;BR /&gt;&lt;BR /&gt;INPUT_DIR=&lt;FONT color="#FF0000"&gt;/usr/local/tomcat/ocr_input&lt;/FONT&gt;&lt;BR /&gt;OUTPUT_DIR=&lt;FONT color="#FF0000"&gt;/usr/local/tomcat/ocr_output&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;# ocrmypdf hostname&lt;BR /&gt;OCRMYPDF_SERVER="ocrmypdf"&lt;BR /&gt;&lt;BR /&gt;# identify parameters, input and output file&lt;BR /&gt;array=( "$@" )&lt;BR /&gt;len=${#array[@]}&lt;BR /&gt;ARGS=${array[@]:0:$len-2}&lt;BR /&gt;&lt;BR /&gt;LAST_ARGS="${@: -2}"&lt;BR /&gt;INPUT_FILE_PARAM=`echo "$LAST_ARGS" | cut -d ' ' -f 1`&lt;BR /&gt;OUTPUT_FILE_PARAM=`echo "$LAST_ARGS" | cut -d ' ' -f 2`&lt;BR /&gt;&lt;BR /&gt;# extract filenames&lt;BR /&gt;INPUT_FILE=$(basename "$INPUT_FILE_PARAM")&lt;BR /&gt;OUTPUT_FILE=$(basename "$OUTPUT_FILE_PARAM")&lt;BR /&gt;&lt;BR /&gt;# SSH parameters&lt;BR /&gt;SCP=cp&lt;BR /&gt;SSH=ssh&lt;BR /&gt;USER=root&lt;BR /&gt;&lt;BR /&gt;# copy original pdf to ocrmypdf server&lt;BR /&gt;$SCP $INPUT_FILE_PARAM $INPUT_DIR&lt;BR /&gt;&lt;BR /&gt;# execute ocrmypdf program&lt;BR /&gt;$SSH $USER@$OCRMYPDF_SERVER "/usr/bin/ocr.sh $ARGS $INPUT_DIR/$INPUT_FILE $OUTPUT_DIR/$OUTPUT_FILE"&lt;BR /&gt;&lt;BR /&gt;# copy transformed pdf back to alfresco path&lt;BR /&gt;$SCP $OUTPUT_DIR/$OUTPUT_FILE $OUTPUT_FILE_PARAM&lt;BR /&gt;&lt;BR /&gt;# remove temporal files&lt;BR /&gt;rm -f $INPUT_DIR/$INPUT_FILE $OUTPUT_DIR/$OUTPUT_FILE&lt;/PRE&gt;&lt;P&gt;After the above changes I was able to successfully run OCR with Alfresco 6.1.&amp;nbsp;&lt;/P&gt;&lt;P&gt;As we are running our Alfresco instance on Kubernetes and using HELM deployment, I need to configure the&amp;nbsp; volumes in values.yaml file but I am not sure how to configure the volumes in values.yaml file. Any one has any idea on how we can make similar configuration in kubernetes.&lt;/P&gt;&lt;P&gt;Any help apprecaited.&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Sriram&lt;/P&gt;</description>
      <pubDate>Tue, 23 Jun 2020 22:04:50 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104654#M29672</guid>
      <dc:creator>SriramG</dc:creator>
      <dc:date>2020-06-23T22:04:50Z</dc:date>
    </item>
    <item>
      <title>Re: OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104655#M29673</link>
      <description>&lt;P&gt;Hello, I'm having trouble configuring ocrmypdf in Alfresco. I am using the "alfresco-content-repository-community: 6.2.0-ga" version. After I follow the setup instructions, the option to configure the OCR action is not displayed in Alfresco. Would you help me? Follow the link for the project I'm running.&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/guilhermekelling/ocr.git" target="_blank" rel="nofollow noopener noreferrer"&gt;https://github.com/guilhermekelling/ocr.git&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;Guilherme Kelling&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jul 2020 14:34:04 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104655#M29673</guid>
      <dc:creator>guilhermekellin</dc:creator>
      <dc:date>2020-07-14T14:34:04Z</dc:date>
    </item>
    <item>
      <title>Re: OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104656#M29674</link>
      <description>&lt;P&gt;OCR EXtract is an action which we assign to a folder as rule. Could you please check if the "OCR Extract" action is availble under actions in Folder rule?&lt;/P&gt;&lt;P&gt;If you are not seeing the action in the folder rule then the "simple-ocr-repo-2.3.1.jar" is either not properly installed in alfresco repository or look out for any exceptions around it.&lt;/P&gt;&lt;P&gt;- Sriram&lt;/P&gt;</description>
      <pubDate>Wed, 15 Jul 2020 14:46:59 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104656#M29674</guid>
      <dc:creator>SriramG</dc:creator>
      <dc:date>2020-07-15T14:46:59Z</dc:date>
    </item>
    <item>
      <title>Re: OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104657#M29675</link>
      <description>&lt;P&gt;Good morning Sriram,&lt;/P&gt;&lt;P&gt;You were right, the file "imple-ocr-repo-2.3.1.jar" was not in the right local. After adjusting the configuration, the option was displayed in Alfresco.&lt;/P&gt;&lt;P&gt;Thank you for your help.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jul 2020 11:56:58 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104657#M29675</guid>
      <dc:creator>guilhermekellin</dc:creator>
      <dc:date>2020-07-16T11:56:58Z</dc:date>
    </item>
    <item>
      <title>Re: OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104658#M29676</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;A href="https://migration33.stage.lithium.com/t5/user/viewprofilepage/user-id/52129"&gt;@guilhermekellin&lt;/A&gt;,&lt;/P&gt;
&lt;P&gt;Great that&amp;nbsp;&lt;A href="https://migration33.stage.lithium.com/t5/user/viewprofilepage/user-id/82140"&gt;@SriramG&lt;/A&gt;&amp;nbsp;was able to help you resolve your problem - thanks for reporting back. I've marked this as solved.&lt;/P&gt;
&lt;P&gt;Kind regards,&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jul 2020 13:33:38 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/104658#M29676</guid>
      <dc:creator>EddieMay</dc:creator>
      <dc:date>2020-07-16T13:33:38Z</dc:date>
    </item>
    <item>
      <title>Re: OCRMyPDF(alfresco-simple-ocr) integration with Alfresco 6.1 using Kubernetes &amp; Helm charts</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/488998#M40064</link>
      <description>&lt;P class=""&gt;Dear Sir,&lt;/P&gt;&lt;P class=""&gt;I am currently working on integrating OCR functionality into &lt;STRONG&gt;Alfresco 6.2, &lt;/STRONG&gt;running on a&amp;nbsp;&lt;STRONG&gt;&lt;STRONG&gt;Windows Server. &lt;/STRONG&gt;&lt;/STRONG&gt;I have successfully installed the following dependencies&lt;STRONG&gt;&lt;STRONG&gt;:&lt;/STRONG&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Tesseract&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Ghostscript&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;OCRmyPDF&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;I have placed the required JAR files:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-repo-2.3.1.jar&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-share-2.3.1.jar&lt;/P&gt;&lt;P class=""&gt;into the appropriate &lt;STRONG&gt;platform and &lt;STRONG&gt;share directories of the Alfresco installation.&lt;/STRONG&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;The following properties have been added to the alfresco-global.properties file:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.command=C:/Users/admin/AppData/Roaming/Python/Python313/Scripts/ocrmypdf.exe &lt;SPAN class=""&gt;ocr.output.verbose=&lt;SPAN class=""&gt;true&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.output.file.prefix.command=&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.extra.commands=--verbose &lt;SPAN class=""&gt;1 --force-ocr --deskew -l eng+spa+fra &lt;SPAN class=""&gt;ocr.server.os=windows&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN&gt;However, when I attempt to use the &lt;STRONG&gt;OCR feature&lt;SPAN&gt; from the document details section, I encounter the following error:&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;Exception in thread &lt;SPAN class=""&gt;"defaultAsyncAction1" java.lang.&lt;SPAN class=""&gt;RuntimeException: java.lang.&lt;SPAN class=""&gt;RuntimeException: java.lang.&lt;SPAN class=""&gt;RuntimeException: java.lang.IllegalArgumentException: Invalid uri &lt;SPAN class=""&gt;'${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path at es.keensoft.alfresco.ocr.OCRExtractAction.&lt;SPAN class=""&gt;executeImplInternal(OCRExtractAction.&lt;SPAN class=""&gt;java:&lt;SPAN class=""&gt;183) ... Caused by: java.lang.IllegalArgumentException: Invalid uri &lt;SPAN class=""&gt;'${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;P class=""&gt;I would appreciate your assistance in identifying the cause and guiding me toward a resolution. Please let me know if you require any further logs, configuration files, or additional details.&lt;/P&gt;&lt;P class=""&gt;Thank you in advance for your support.&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Tue, 15 Apr 2025 10:14:41 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocrmypdf-alfresco-simple-ocr-integration-with-alfresco-6-1-using/m-p/488998#M40064</guid>
      <dc:creator>ShivanandaL</dc:creator>
      <dc:date>2025-04-15T10:14:41Z</dc:date>
    </item>
  </channel>
</rss>

