cancel
Showing results for 
Search instead for 
Did you mean: 

Alfresco integrate OCR

dev7
Champ in-the-making
Champ in-the-making

I try integrate OCR in Alfresco but no success. I instal

 alfresco-community-installer-201612-win-x64 

and after that i instal tesseract in C:\Program Files (x86)\Tesseract-OCR

Then I try to integrate tesseract in alfresco,

first i create in C:\alfresco-community bat file OCR.bat

REM to see what happens echo from %1 to %2 >>C:\tmp\ocrtransform.log  copy /Y %1 C:\TMP\%~n1%~x1  REM  call tesseract and redirect output to $TARGET "C:\Program Files (x86)\Tesseract-OCR\tesseract.exe" C:\TMP\%~n1%~x1 %~d2%~p2%~n2 -l eng del C:\TMP\%~n1%~x1

Then three transformation xml added in "C:\alfresco-community\tomcat\shared\classes\alfresco\extension"

ocrpng-transform-context.xml  ocrjpeg-transform-context.xml  ocrtiff-transform-context.xml

then create an empty file name with ocrtransform.log in "C:\TMP"

and Downolad this jar from this link(https://docs.google.com/file/d/0B94FD2QmPSJCNHpuUVlicW95UjA/edit) and place this jar in this path "C:\Alfresco\tomcat\lib".

I use this link

http://alfrescosolutionsv.blogspot.com/2014/07/alfresco-integration-with-tesseract.html

but not working. Where I make mistake? Do I need to add anything else?

Any suggestion, idea?

1 REPLY 1

afaust
Legendary Innovator
Legendary Innovator

The blog post you found is quite old and based on an older Alfresco version. The changes suggested there may not be compatible with the newest version of Alfresco that you are using. Also, the distribution method for that extension / customisation is quite unprofessional. You should probably stick to more professional solutions, like GitHub - keensoft/alfresco-simple-ocr: Simple OCR action for Alfresco  from the company ‌ works for.