<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Alfresc indexing slow due to transformation in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264684#M217814</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;By the way we have enough free space on hard drive so this is not an issue.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Additionally I tried to run Alfresco with 2G max heap size configured and saw exactly the same issue.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;After some time (5-15 min) memory was pumped up to max but it was still handling all http requests properly.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;And eventually it slowed down and hanged. CPU utilization showed with JConsole was 100% and it caused by GC called too often trying to release memory.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We have small amount of users working with Alfresco (10-20 maximum).&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;We have reporting tool traversing JCR tree to generate report but JProfiler didn't show any memory leaks related that.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 12 May 2011 22:19:45 GMT</pubDate>
    <dc:creator>dmorozov</dc:creator>
    <dc:date>2011-05-12T22:19:45Z</dc:date>
    <item>
      <title>Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264683#M217813</link>
      <description>Hello,I have been fighting last week with Alfresco going terribly slow because (I think) of Tika transformations happening in background.Please provide an advice how to solve this issue.We have Alfresco 3.4.d installed on Ubuntu 64 bit server.RAM: 16GCPU: 4JVM settings: -Djava.awt.headless=true -ser</description>
      <pubDate>Thu, 12 May 2011 22:14:24 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264683#M217813</guid>
      <dc:creator>dmorozov</dc:creator>
      <dc:date>2011-05-12T22:14:24Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264684#M217814</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;By the way we have enough free space on hard drive so this is not an issue.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Additionally I tried to run Alfresco with 2G max heap size configured and saw exactly the same issue.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;After some time (5-15 min) memory was pumped up to max but it was still handling all http requests properly.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;And eventually it slowed down and hanged. CPU utilization showed with JConsole was 100% and it caused by GC called too often trying to release memory.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We have small amount of users working with Alfresco (10-20 maximum).&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;We have reporting tool traversing JCR tree to generate report but JProfiler didn't show any memory leaks related that.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 12 May 2011 22:19:45 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264684#M217814</guid>
      <dc:creator>dmorozov</dc:creator>
      <dc:date>2011-05-12T22:19:45Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264685#M217815</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;we have the same problem with 3.4d. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;For us the problem is caused when Alfresco tries to index a .xlsx file that have one million rows.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;the threaddump:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;"schedulerFactory_Worker-4" - Thread t@30&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp; java.lang.Thread.State: RUNNABLE&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.ClassLoader.defineClass1(Native Method)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.ClassLoader.defineClassCond(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.ClassLoader.defineClass(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.security.SecureClassLoader.defineClass(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.catalina.loader.WebappClassLoader.findClassInternal(WebappClassLoader.java:2733)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked org.apache.catalina.loader.WebappClassLoader@1dec1dd&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.catalina.loader.WebappClassLoader.findClass(WebappClassLoader.java:1124)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.java:1612)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked org.apache.catalina.loader.WebappClassLoader@1dec1dd&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.java:1491)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.Class.forName0(Native Method)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.Class.forName(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplClass(SchemaTypeImpl.java:1709)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor(SchemaTypeImpl.java:1725)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1853)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.xmlbeans.impl.store.Xobj.find_element_user(Xobj.java:2062)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.openxmlformats.schemas.spreadsheetml.x2006.main.impl.CTRowImpl.getCArray(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked org.apache.xmlbeans.impl.store.Locale@3f3b50&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.openxmlformats.schemas.spreadsheetml.x2006.main.impl.CTRowImpl$1CList.get(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.openxmlformats.schemas.spreadsheetml.x2006.main.impl.CTRowImpl$1CList.get(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.util.AbstractList$Itr.next(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.usermodel.XSSFRow.&amp;lt;init&amp;gt;(XSSFRow.java:66)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.usermodel.XSSFSheet.initRows(XSSFSheet.java:178)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.usermodel.XSSFSheet.read(XSSFSheet.java:147)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.usermodel.XSSFSheet.onDocumentRead(XSSFSheet.java:134)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.usermodel.XSSFWorkbook.onDocumentRead(XSSFWorkbook.java:234)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:190)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.usermodel.XSSFWorkbook.&amp;lt;init&amp;gt;(XSSFWorkbook.java:182)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.xssf.extractor.XSSFExcelExtractor.&amp;lt;init&amp;gt;(XSSFExcelExtractor.java:56)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:172)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:152)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:65)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:68)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.TikaOfficeDetectParser.parse(TikaOfficeDetectParser.java:78)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.transform.TikaPoweredContentTransformer.transformInternal(TikaPoweredContentTransformer.java:185)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:161)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:137)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.indexProperty(ADMLuceneIndexerImpl.java:944)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocumentsImpl(ADMLuceneIndexerImpl.java:620)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocuments(ADMLuceneIndexerImpl.java:585)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.updateFullTextSearch(ADMLuceneIndexerImpl.java:1580)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.fts.FullTextSearchIndexerImpl.index(FullTextSearchIndexerImpl.java:217)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.reflect.GeneratedMethodAccessor406.invoke(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.reflect.Method.invoke(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:307)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:183)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:150)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:107)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:172)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:202)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at $Proxy82.index(Unknown Source)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.fts.FTSIndexerJob.execute(FTSIndexerJob.java:46)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.quartz.core.JobRunShell.run(JobRunShell.java:216)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:549)&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 13 May 2011 10:10:55 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264685#M217815</guid>
      <dc:creator>vmm</dc:creator>
      <dc:date>2011-05-13T10:10:55Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264686#M217816</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I'm not aware of a way to prevent indexing of a specific file, but if you know what file it is, put a password on the file. That way transformation will fail (rather than halting the whole server). If needed, write the password in the description field so that those who needs to open can do so.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;It would just be a workaround, and not a terribly good one, just thought that it may be worth having as an option.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Another would be to disable content transformations for all excel files until properly resolved. Have a look in content-services-context.xml to find out what beans to change/disable. Maybe comment out the bean &amp;lt;bean id="transformer.Poi", that way I think it will user OpenOffice instead. But you have to try this out.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 13 May 2011 10:39:17 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264686#M217816</guid>
      <dc:creator>loftux</dc:creator>
      <dc:date>2011-05-13T10:39:17Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264687#M217817</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I've had a chat with our resident Tika Guru who pointed out that this sounds like TIKA-521 which was fixed late last year.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So you may like to try the latest version of Tika.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 13 May 2011 12:49:34 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264687#M217817</guid>
      <dc:creator>mrogers</dc:creator>
      <dc:date>2011-05-13T12:49:34Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264688#M217818</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;We tried with tika 0.9 and had the same problem.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;For the moment we have comment the mapping for xlsx and xls files in mimetype-map.xml, so Alfresco stores excel files like binarys.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 16 May 2011 12:37:26 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264688#M217818</guid>
      <dc:creator>vmm</dc:creator>
      <dc:date>2011-05-16T12:37:26Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264689#M217819</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;You need version 1.0 or above…&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 16 May 2011 13:29:37 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264689#M217819</guid>
      <dc:creator>mrogers</dc:creator>
      <dc:date>2011-05-16T13:29:37Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264690#M217820</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thank you very much it helped.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So here is solution for others:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;1. Checkout Tika sources trunk (google for it)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2. Build. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Notes: It will create tika 1.0 snapshot version. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Only one issue I got with compilation is missed jdom artifact in tika-parsers submodule. Just add this dependency into tika-parsers/pom.xml and build.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Additionally make sure that you will update all dependent libraries. For example it brings new versions of Apache POI (3.8-beta2) and PDFBox (1.5) libraries.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;In my case I just created empty web application with maven and put tika-core and tika-parsers as dependencies. Maven will collect all required libs for you.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Then you will just need to make sure that in your Alfresco you have correct versions. Add extra libraries resolved by maven just in case.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After patching Tika I able to run my Alfresco with 2G heap size configured and average memory usage is about 600M with jumps up to 2G while documents re-indexing.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 20 May 2011 18:40:31 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264690#M217820</guid>
      <dc:creator>dmorozov</dc:creator>
      <dc:date>2011-05-20T18:40:31Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264691#M217821</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I am having the exact same issue,&amp;nbsp; Could you give me any step by step instructions on how you fixed this.&amp;nbsp; Thanks.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 07 Feb 2012 19:35:01 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264691#M217821</guid>
      <dc:creator>promethius</dc:creator>
      <dc:date>2012-02-07T19:35:01Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264692#M217822</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Just upgrade to 4.0.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 07 Feb 2012 20:44:07 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264692#M217822</guid>
      <dc:creator>mrogers</dc:creator>
      <dc:date>2012-02-07T20:44:07Z</dc:date>
    </item>
    <item>
      <title>Re: Alfresc indexing slow due to transformation</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264693#M217823</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;We have the same problem with 3.4d, and an upgrade to 4.0 isn't possible yet.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Because the worker hogs so much cpu, other problems arise.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Is there any way to sum up all the new .JARs that need to be installed for the indexing to work properly again and stop it locking up?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 12 Mar 2012 11:36:35 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alfresc-indexing-slow-due-to-transformation/m-p/264693#M217823</guid>
      <dc:creator>ebogaard</dc:creator>
      <dc:date>2012-03-12T11:36:35Z</dc:date>
    </item>
  </channel>
</rss>

