cancel
Showing results for 
Search instead for 
Did you mean: 

Tesseract Alfresco Intergration

kavilash23
Champ on-the-rise
Champ on-the-rise
Hi Guys,   

I have a custom script which converts jpg to txt using tessaract 3.01. When I execute the script from command line the conversion works fine but when the same script is executed from within ALfresco 4.2.2, it seems the application cannot execute tesseract command while other commands in the script are executed successfully. There is no error in alfresco log files. This issue is happening on my redhat 6.2 box but not on my ubuntu 12.04 box.

#!/bin/sh
SOURCE=$1
TARGET=ocrOutput
echo "Starting tesseract"
tesseract $SOURCE $TARGET
echo " Finished"


Can someone please advice?

4 REPLIES 4

afaust
Legendary Innovator
Legendary Innovator
Hello,

simple/dumb question: The Linux user that runs the Alfresco process has all the required privileges? E.g. is he allowed to see/execute tesseract executable?

Regards
Axel

kavilash23
Champ on-the-rise
Champ on-the-rise
Hi Axel,

Alfresco is being run as root user so permission should not be a problem. SELinux is also disable on the box.

Any other suggestions?

Thanks.

rvp29
Champ in-the-making
Champ in-the-making
Has this issue been resolved?  I'm having similar issues on my RHEL6.5

# rpm -qa | grep -i tess
tesseract-eng-3.01-2.el6.noarch
tesseract-3.01-2.el6.x86_64
#

Thanks


tesseract-3.01-2.el6.x86_64 - is been installed

kavilash23
Champ on-the-rise
Champ on-the-rise
Hi,

We indeed did solve this issue.
See our blog post for the solution.
http://www.seedim.com.au/content/alfresco-search-pdf-images-using-transformations-and-tesseract-ocr
Getting started

Tags


Find what you came for

We want to make your experience in Hyland Connect as valuable as possible, so we put together some helpful links.