<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Configuring OCR in Alfresco - Alfresco Community 5.2 in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110105#M30843</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am successfully configured OCR with my alfresco (windows installation). But, it only working for PNG, TIFF, JPG &amp;amp; GPEG. But i need it for PDF extension also because most of scanned files are in pdf format.&lt;/P&gt;&lt;P&gt;My&amp;nbsp;tesseract-ocr-transform-context.xml is,&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P style="padding-left: 30px;"&gt;&amp;lt;?xml version='1.0' encoding='UTF-8'?&amp;gt;&lt;BR /&gt;&amp;lt;!DOCTYPE beans PUBLIC '-//SPRING//DTD BEAN//EN' 'http://www.springframework.org/dtd/spring-beans.dtd'&amp;gt;&lt;BR /&gt;&amp;lt;beans default-lazy-init="false" default-autowire="no" default-dependency-check="none"&amp;gt;&lt;BR /&gt; &amp;lt;bean id="transformer.worker.ocr.tiff" class="org.alfresco.repo.content.transform.RuntimeExecutableContentTransformerWorker" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="mimetypeService"&amp;gt;&lt;BR /&gt; &amp;lt;ref bean="mimetypeService" /&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="checkCommand"&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.util.exec.RuntimeExec" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="commandsAndArguments"&amp;gt;&lt;BR /&gt; &amp;lt;map&amp;gt;&lt;BR /&gt; &amp;lt;entry key="Windows.*"&amp;gt;&lt;BR /&gt; &amp;lt;list&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;C:\Windows\System32\cmd.exe&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;/C&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;dir C:\alfresco-community\ocr.bat&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/list&amp;gt;&lt;BR /&gt; &amp;lt;/entry&amp;gt;&lt;BR /&gt; &amp;lt;/map&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="errorCodes"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;1&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="transformCommand"&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.util.exec.RuntimeExec" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="commandsAndArguments"&amp;gt;&lt;BR /&gt; &amp;lt;map&amp;gt;&lt;BR /&gt; &amp;lt;entry key="Windows.*"&amp;gt;&lt;BR /&gt; &amp;lt;list&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;C:\Windows\System32\cmd.exe&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;/C&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;C:\alfresco-community\ocr.bat&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;"${source}"&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;"${target}"&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/list&amp;gt;&lt;BR /&gt; &amp;lt;/entry&amp;gt;&lt;BR /&gt; &amp;lt;/map&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="errorCodes"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;1,2&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="explicitTransformations"&amp;gt;&lt;BR /&gt; &amp;lt;list&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/tiff&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/png&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/jpeg&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/jpg&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;/list&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean id="transformer.ocr.tiff" class="org.alfresco.repo.content.transform.ProxyContentTransformer" parent="baseContentTransformer" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="worker"&amp;gt;&lt;BR /&gt; &amp;lt;ref bean="transformer.worker.ocr.tiff" /&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt;&amp;lt;/beans&amp;gt;&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;/BLOCKQUOTE&gt;&lt;/BLOCKQUOTE&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;I changed it into this including another bean (red colored),&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&amp;lt;?xml version='1.0' encoding='UTF-8'?&amp;gt;&lt;BR /&gt;&amp;lt;!DOCTYPE beans PUBLIC '-//SPRING//DTD BEAN//EN' 'http://www.springframework.org/dtd/spring-beans.dtd'&amp;gt;&lt;BR /&gt;&amp;lt;beans default-lazy-init="false" default-autowire="no" default-dependency-check="none"&amp;gt;&lt;BR /&gt; &amp;lt;bean id="transformer.worker.ocr.tiff" class="org.alfresco.repo.content.transform.RuntimeExecutableContentTransformerWorker" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="mimetypeService"&amp;gt;&lt;BR /&gt; &amp;lt;ref bean="mimetypeService" /&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="checkCommand"&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.util.exec.RuntimeExec" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="commandsAndArguments"&amp;gt;&lt;BR /&gt; &amp;lt;map&amp;gt;&lt;BR /&gt; &amp;lt;entry key="Windows.*"&amp;gt;&lt;BR /&gt; &amp;lt;list&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;C:\Windows\System32\cmd.exe&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;/C&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;dir C:\alfresco-community\ocr.bat&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/list&amp;gt;&lt;BR /&gt; &amp;lt;/entry&amp;gt;&lt;BR /&gt; &amp;lt;/map&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="errorCodes"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;1&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="transformCommand"&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.util.exec.RuntimeExec" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="commandsAndArguments"&amp;gt;&lt;BR /&gt; &amp;lt;map&amp;gt;&lt;BR /&gt; &amp;lt;entry key="Windows.*"&amp;gt;&lt;BR /&gt; &amp;lt;list&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;C:\Windows\System32\cmd.exe&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;/C&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;C:\alfresco-community\ocr.bat&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;"${source}"&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;"${target}"&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/list&amp;gt;&lt;BR /&gt; &amp;lt;/entry&amp;gt;&lt;BR /&gt; &amp;lt;/map&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="errorCodes"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;1,2&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="explicitTransformations"&amp;gt;&lt;BR /&gt; &amp;lt;list&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/tiff&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/png&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &lt;STRONG style="color: #ff6600; "&gt;&amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;value&amp;gt;image/pdf&amp;lt;/value&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;/property&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;/property&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG style="color: #ff6600; "&gt; &amp;lt;/bean&amp;gt;&lt;/STRONG&gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/jpeg&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean class="org.alfresco.repo.content.transform.ExplictTransformationDetails" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="sourceMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;image/jpg&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;property name="targetMimetype"&amp;gt;&lt;BR /&gt; &amp;lt;value&amp;gt;text/plain&amp;lt;/value&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;/list&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt; &amp;lt;bean id="transformer.ocr.tiff" class="org.alfresco.repo.content.transform.ProxyContentTransformer" parent="baseContentTransformer" lazy-init="default" autowire="default" dependency-check="default"&amp;gt;&lt;BR /&gt; &amp;lt;property name="worker"&amp;gt;&lt;BR /&gt; &amp;lt;ref bean="transformer.worker.ocr.tiff" /&amp;gt;&lt;BR /&gt; &amp;lt;/property&amp;gt;&lt;BR /&gt; &amp;lt;/bean&amp;gt;&lt;BR /&gt;&amp;lt;/beans&amp;gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;but, it gives me bellow error.&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'transformer.ocr.tiff' defined in file [C:\alfresco-community\tomcat\shared\classes\alfresco\extension\tesseract-ocr-transform-context.xml]: Invocation of init method failed; nested exception is java.lang.IllegalArgumentException: content.transformer.ocr.tiff.extensions.bin.txt.priority has been specified more than once&lt;BR /&gt;2019-06-02 20:00:46,019 INFO [org.alfresco.repo.management.subsystems.ChildApplicationContextFactory] [localhost-startStop-1] Stopping 'Transformers' subsystem, ID: [Transformers, default]&lt;BR /&gt;2019-06-02 20:00:46,020 INFO [org.alfresco.repo.management.subsystems.ChildApplicationContextFactory] [localhost-startStop-1] Stopped 'Transformers' subsystem, ID: [Transformers, default]&lt;BR /&gt;2019-06-02 20:00:46,021 INFO [org.alfresco.repo.management.subsystems.ChildApplicationContextFactory] [localhost-startStop-1] Stopping 'Authentication' subsystem, ID: [Authentication, managed, alfrescoNtlm1]&lt;BR /&gt;2019-06-02 20:00:46,021 INFO [org.alfresco.repo.management.subsystems.ChildApplicationContextFactory] [localhost-startStop-1] Stopped 'Authentication' subsystem, ID: [Authentication, managed, alfrescoNtlm1]&lt;BR /&gt;2019-06-02 20:00:46,030 ERROR [org.springframework.web.context.ContextLoader] [localhost-startStop-1] Context initialization failed&lt;BR /&gt;org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'transformer.ocr.tiff' defined in file [C:\alfresco-community\tomcat\shared\classes\alfresco\extension\tesseract-ocr-transform-context.xml]: Invocation of init method failed; nested exception is java.lang.IllegalArgumentException: content.transformer.ocr.tiff.extensions.bin.txt.priority has been specified more than once&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.initializeBean(AbstractAutowireCapableBeanFactory.java:1514)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.doCreateBean(AbstractAutowireCapableBeanFactory.java:521)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.createBean(AbstractAutowireCapableBeanFactory.java:458)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractBeanFactory$1.getObject(AbstractBeanFactory.java:293)&lt;BR /&gt; at org.springframework.beans.factory.support.DefaultSingletonBeanRegistry.getSingleton(DefaultSingletonBeanRegistry.java:223)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractBeanFactory.doGetBean(AbstractBeanFactory.java:290)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractBeanFactory.getBean(AbstractBeanFactory.java:191)&lt;BR /&gt; at org.springframework.beans.factory.support.DefaultListableBeanFactory.preInstantiateSingletons(DefaultListableBeanFactory.java:636)&lt;BR /&gt; at org.springframework.context.support.AbstractApplicationContext.finishBeanFactoryInitialization(AbstractApplicationContext.java:938)&lt;BR /&gt; at org.springframework.context.support.AbstractApplicationContext.refresh(AbstractApplicationContext.java:479)&lt;BR /&gt; at org.springframework.web.context.ContextLoader.configureAndRefreshWebApplicationContext(ContextLoader.java:410)&lt;BR /&gt; at org.springframework.web.context.ContextLoader.initWebApplicationContext(ContextLoader.java:306)&lt;BR /&gt; at org.springframework.web.context.ContextLoaderListener.contextInitialized(ContextLoaderListener.java:112)&lt;BR /&gt; at org.alfresco.web.app.ContextLoaderListener.contextInitialized(ContextLoaderListener.java:70)&lt;BR /&gt; at org.apache.catalina.core.StandardContext.listenerStart(StandardContext.java:5118)&lt;BR /&gt; at org.apache.catalina.core.StandardContext.startInternal(StandardContext.java:5634)&lt;BR /&gt; at org.apache.catalina.util.LifecycleBase.start(LifecycleBase.java:145)&lt;BR /&gt; at org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:899)&lt;BR /&gt; at org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:875)&lt;BR /&gt; at org.apache.catalina.core.StandardHost.addChild(StandardHost.java:652)&lt;BR /&gt; at org.apache.catalina.startup.HostConfig.deployDescriptor(HostConfig.java:679)&lt;BR /&gt; at org.apache.catalina.startup.HostConfig$DeployDescriptor.run(HostConfig.java:1966)&lt;BR /&gt; at java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source)&lt;BR /&gt; at java.util.concurrent.FutureTask.run(Unknown Source)&lt;BR /&gt; at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)&lt;BR /&gt; at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)&lt;BR /&gt; at java.lang.Thread.run(Unknown Source)&lt;BR /&gt;Caused by: java.lang.IllegalArgumentException: content.transformer.ocr.tiff.extensions.bin.txt.priority has been specified more than once&lt;BR /&gt; at org.alfresco.repo.content.transform.TransformerPropertySetter.setProperties(TransformerPropertySetter.java:119)&lt;BR /&gt; at org.alfresco.repo.content.transform.TransformerConfigImpl.setProperties(TransformerConfigImpl.java:239)&lt;BR /&gt; at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)&lt;BR /&gt; at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)&lt;BR /&gt; at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)&lt;BR /&gt; at java.lang.reflect.Method.invoke(Unknown Source)&lt;BR /&gt; at org.alfresco.repo.management.subsystems.SubsystemProxyFactory$1.invoke(SubsystemProxyFactory.java:79)&lt;BR /&gt; at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:172)&lt;BR /&gt; at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)&lt;BR /&gt; at com.sun.proxy.$Proxy25.setProperties(Unknown Source)&lt;BR /&gt; at org.alfresco.repo.content.transform.ContentTransformerHelper.logDeprecatedSetter(ContentTransformerHelper.java:251)&lt;BR /&gt; at org.alfresco.repo.content.transform.ProxyContentTransformer.register(ProxyContentTransformer.java:74)&lt;BR /&gt; at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)&lt;BR /&gt; at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)&lt;BR /&gt; at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)&lt;BR /&gt; at java.lang.reflect.Method.invoke(Unknown Source)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.invokeCustomInitMethod(AbstractAutowireCapableBeanFactory.java:1640)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.invokeInitMethods(AbstractAutowireCapableBeanFactory.java:1581)&lt;BR /&gt; at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.initializeBean(AbstractAutowireCapableBeanFactory.java:1511)&lt;BR /&gt; ... 26 more&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;Did i missed anything or there is another way to do this?&lt;/P&gt;&lt;P&gt;please help me.&lt;/P&gt;&lt;P&gt;Sorry for my English.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sun, 02 Jun 2019 15:29:26 GMT</pubDate>
    <dc:creator>anuradha1</dc:creator>
    <dc:date>2019-06-02T15:29:26Z</dc:date>
    <item>
      <title>Configuring OCR in Alfresco - Alfresco Community 5.2</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110105#M30843</link>
      <description>Hi,I am successfully configured OCR with my alfresco (windows installation). But, it only working for PNG, TIFF, JPG &amp;amp; GPEG. But i need it for PDF extension also because most of scanned files are in pdf format.My&amp;nbsp;tesseract-ocr-transform-context.xml is,&amp;lt;?xml version='1.0' encoding='UTF-8'?&amp;gt;</description>
      <pubDate>Sun, 02 Jun 2019 15:29:26 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110105#M30843</guid>
      <dc:creator>anuradha1</dc:creator>
      <dc:date>2019-06-02T15:29:26Z</dc:date>
    </item>
    <item>
      <title>Re: Configuring OCR in Alfresco - Alfresco Community 5.2</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110106#M30844</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please refer&amp;nbsp;&lt;A class="link-titled" href="http://www.contcentric.com/configuring-ocr-in-alfresco/" title="http://www.contcentric.com/configuring-ocr-in-alfresco/" rel="nofollow noopener noreferrer"&gt;Configuring OCR in Alfresco | ContCentric&lt;/A&gt;&amp;nbsp;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Vidhi&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.contcentric.com/" rel="nofollow noopener noreferrer"&gt;ContCentric&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 04 Jun 2019 04:44:28 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/110106#M30844</guid>
      <dc:creator>vidhipanchal</dc:creator>
      <dc:date>2019-06-04T04:44:28Z</dc:date>
    </item>
    <item>
      <title>Re: Configuring OCR in Alfresco - Alfresco Community 5.2</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/489269#M40085</link>
      <description>&lt;P class=""&gt;Hi,&lt;/P&gt;&lt;P class=""&gt;I am currently working on integrating OCR functionality into &lt;STRONG&gt;Alfresco 7.2, running on a Windows Server. I have successfully installed the following dependencies:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Tesseract&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Ghostscript&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;OCRmyPDF&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;I have placed the required JAR files:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-repo-2.3.1.jar&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;simple-ocr-share-2.3.1.jar&lt;/P&gt;&lt;P class=""&gt;into the appropriate &lt;STRONG&gt;platform and share directories of the Alfresco installation.&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;The following properties have been added to the alfresco-global.properties file:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.command=C:/Users/admin/AppData/Roaming/Python/Python313/Scripts/ocrmypdf.exe ocr.output.verbose=true&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.output.file.prefix.command=&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;ocr.extra.commands=--verbose 1 --force-ocr --deskew -l eng+spa+fra ocr.server.os=windows&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;However, when I attempt to use the &lt;STRONG&gt;OCR feature from the document details section, I encounter the following error:&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;Exception in thread "defaultAsyncAction1" java.lang.RuntimeException: java.lang.RuntimeException: java.lang.RuntimeException: java.lang.IllegalArgumentException: Invalid uri '${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS72%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path at es.keensoft.alfresco.ocr.OCRExtractAction.executeImplInternal(OCRExtractAction.java:183) ... Caused by: java.lang.IllegalArgumentException: Invalid uri '${ocr.url}language=--verbose 1 --force-ocr --deskew -l eng+spa+fra&amp;amp;source=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312.pdf&amp;amp;target=H%3A%5CDMS62%5Ctomcat%5Ctemp%5CAlfresco%5COCRTransformWorker_source_8194440309054693312_ocr.pdf': incorrect path&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;P class=""&gt;I would appreciate your assistance in identifying the cause and guiding me toward a resolution. Please let me know if you require any further logs, configuration files, or additional details.&lt;/P&gt;&lt;P class=""&gt;Thank you in advance for your support.&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Thu, 24 Apr 2025 07:44:20 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/configuring-ocr-in-alfresco-alfresco-community-5-2/m-p/489269#M40085</guid>
      <dc:creator>ShivanandaL</dc:creator>
      <dc:date>2025-04-24T07:44:20Z</dc:date>
    </item>
  </channel>
</rss>

