cancel
Showing results for 
Search instead for 
Did you mean: 

Error after rebuilding lucene indexes

manfred99
Champ in-the-making
Champ in-the-making
Hi@all,

for some days, openoffice was not running in the background (Linux) and a lot of documents entered into Alfresco 2.1 during this period of time were not indexed. So I decided to rebuild lucene indexes:
1. stop tomcat5
2. rename lucene-indexes to old_lucene-indexes
3. edit properties with index.recovery.mode=FULL
4. start tomcat5)

which - according to catalina.out - ended at 100%. But then the following error occured.
After Alfresco comes up, only documents are found by search which were indexed before the rebuild. But that was exactly my problem before rebuilding the indexex…
Another point: The time used to rebuild makes me wonder: 10min for about 10GB of content? I remember a rebuild with version 1.4 of Alfresco which took more than 12 hours (but on other hardware!).

Can anyone please help?
Thank you!


2007-09-29 11:03:57,084 [main] INFO  org.apache.catalina.core.StandardHostDeployer - Installing web application at context path /alfresco from URL file:/srv/www/tomcat5/base/webapps/alfresco
11:04:22,446 WARN  [remoting.rmi.RmiRegistryFactoryBean] Could not detect RMI registry - creating new one
11:04:26,640 INFO  [domain.schema.SchemaBootstrap] Schema managed by database dialect org.hibernate.dialect.MySQLInnoDBDialect.
11:04:28,837 INFO  [domain.schema.SchemaBootstrap] No changes were made to the schema.
11:04:31,106 INFO  [node.index.FullIndexRecoveryComponent] Index recovery started: 30,776 transactions.
11:05:18,737 INFO  [node.index.FullIndexRecoveryComponent]    10 % complete.
11:05:55,945 INFO  [node.index.FullIndexRecoveryComponent]    20 % complete.
11:06:32,260 INFO  [node.index.FullIndexRecoveryComponent]    30 % complete.
11:07:11,650 INFO  [node.index.FullIndexRecoveryComponent]    40 % complete.
11:07:53,048 INFO  [node.index.FullIndexRecoveryComponent]    50 % complete.
11:08:31,698 INFO  [node.index.FullIndexRecoveryComponent]    60 % complete.
11:09:12,202 INFO  [node.index.FullIndexRecoveryComponent]    70 % complete.
11:09:54,657 INFO  [node.index.FullIndexRecoveryComponent]    80 % complete.
11:10:39,368 INFO  [node.index.FullIndexRecoveryComponent]    90 % complete.
11:11:24,662 INFO  [node.index.FullIndexRecoveryComponent]    100 % complete.
11:11:24,678 INFO  [node.index.FullIndexRecoveryComponent] Index recovery completed.
java.lang.reflect.InvocationTargetException
   at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
   at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
   at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
   at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
   at org.apache.poi.hssf.record.RecordFactory.createRecord(RecordFactory.java:224)
   at org.apache.poi.hssf.record.RecordFactory.createRecords(RecordFactory.java:160)
   at org.apache.poi.hssf.usermodel.HSSFWorkbook.<init>(HSSFWorkbook.java:163)
   at org.apache.poi.hssf.usermodel.HSSFWorkbook.<init>(HSSFWorkbook.java:210)
   at org.apache.poi.hssf.usermodel.HSSFWorkbook.<init>(HSSFWorkbook.java:191)
   at org.alfresco.repo.content.transform.PoiHssfContentTransformer.transformInternal(PoiHssfContentTransformer.java:87)
   at org.alfresco.repo.content.transform.AbstractContentTransformer.transform(AbstractContentTransformer.java:246)
   at org.alfresco.repo.content.transform.AbstractContentTransformer.transform(AbstractContentTransformer.java:210)
   at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.indexProperty(ADMLuceneIndexerImpl.java:821)
   at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocuments(ADMLuceneIndexerImpl.java:546)
   at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.indexImpl(AbstractLuceneIndexerImpl.java:631)
   at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.indexImpl(AbstractLuceneIndexerImpl.java:656)
   at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.flushPending(AbstractLuceneIndexerImpl.java:788)
   at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.doPrepare(ADMLuceneIndexerImpl.java:1337)
   at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.prepare(AbstractLuceneIndexerImpl.java:471)
   at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerAndSearcherFactory.prepare(AbstractLuceneIndexerAndSearcherFactory.java:679)
   at org.alfresco.repo.transaction.AlfrescoTransactionSupport$TransactionSynchronizationImpl.beforeCommit(AlfrescoTransactionSupport.java:634)
   at org.springframework.transaction.support.TransactionSynchronizationUtils.triggerBeforeCommit(TransactionSynchronizationUtils.java:48)
   at org.springframework.transaction.support.AbstractPlatformTransactionManager.triggerBeforeCommit(AbstractPlatformTransactionManager.java:821)
   at org.springframework.transaction.support.AbstractPlatformTransactionManager.processCommit(AbstractPlatformTransactionManager.java:637)
   at org.springframework.transaction.support.AbstractPlatformTransactionManager.commit(AbstractPlatformTransactionManager.java:624)
   at org.springframework.transaction.interceptor.TransactionAspectSupport.commitTransactionAfterReturning(TransactionAspectSupport.java:307)
   at org.alfresco.util.transaction.SpringAwareUserTransaction.commit(SpringAwareUserTransaction.java:420)
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:239)
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:155)
   at org.alfresco.repo.node.index.AbstractReindexComponent.reindex(AbstractReindexComponent.java:220)
   at org.alfresco.repo.node.index.IndexRecoveryBootstrapBean.onBootstrap(IndexRecoveryBootstrapBean.java:35)
   at org.alfresco.util.AbstractLifecycleBean.onApplicationEvent(AbstractLifecycleBean.java:62)
   at org.springframework.context.event.SimpleApplicationEventMulticaster$1.run(SimpleApplicationEventMulticaster.java:77)
   at org.springframework.core.task.SyncTaskExecutor.execute(SyncTaskExecutor.java:49)
   at org.springframework.context.event.SimpleApplicationEventMulticaster.multicastEvent(SimpleApplicationEventMulticaster.java:75)
   at org.springframework.context.support.AbstractApplicationContext.publishEvent(AbstractApplicationContext.java:241)
   at org.springframework.context.support.AbstractApplicationContext.refresh(AbstractApplicationContext.java:349)
   at org.springframework.web.context.support.AbstractRefreshableWebApplicationContext.refresh(AbstractRefreshableWebApplicationContext.java:156)
   at org.springframework.web.context.ContextLoader.createWebApplicationContext(ContextLoader.java:246)
   at org.springframework.web.context.ContextLoader.initWebApplicationContext(ContextLoader.java:184)
   at org.springframework.web.context.ContextLoaderListener.contextInitialized(ContextLoaderListener.java:49)
   at org.apache.catalina.core.StandardContext.listenerStart(StandardContext.java:3831)
   at org.apache.catalina.core.StandardContext.start(StandardContext.java:4323)
   at org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:823)
   at org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:807)
   at org.apache.catalina.core.StandardHost.addChild(StandardHost.java:595)
   at org.apache.catalina.core.StandardHostDeployer.install(StandardHostDeployer.java:277)
   at org.apache.catalina.core.StandardHost.install(StandardHost.java:832)
   at org.apache.catalina.startup.HostConfig.deployWARs(HostConfig.java:625)
   at org.apache.catalina.startup.HostConfig.deployApps(HostConfig.java:431)
   at org.apache.catalina.startup.HostConfig.start(HostConfig.java:983)
   at org.apache.catalina.startup.HostConfig.lifecycleEvent(HostConfig.java:349)
   at org.apache.catalina.util.LifecycleSupport.fireLifecycleEvent(LifecycleSupport.java:119)
   at org.apache.catalina.core.ContainerBase.start(ContainerBase.java:1091)
   at org.apache.catalina.core.StandardHost.start(StandardHost.java:789)
   at org.apache.catalina.core.ContainerBase.start(ContainerBase.java:1083)
   at org.apache.catalina.core.StandardEngine.start(StandardEngine.java:478)
   at org.apache.catalina.core.StandardService.start(StandardService.java:480)
   at org.apache.catalina.core.StandardServer.start(StandardServer.java:2313)
   at org.apache.catalina.startup.Catalina.start(Catalina.java:556)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:585)
   at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:287)
   at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:425)
Caused by: java.lang.ArrayIndexOutOfBoundsException
   at java.lang.System.arraycopy(Native Method)
   at org.apache.poi.hssf.record.UnknownRecord.<init>(UnknownRecord.java:62)
   at org.apache.poi.hssf.record.SubRecord.createSubRecord(SubRecord.java:57)
   at org.apache.poi.hssf.record.ObjRecord.fillFields(ObjRecord.java:99)
   at org.apache.poi.hssf.record.Record.fillFields(Record.java:90)
   at org.apache.poi.hssf.record.Record.<init>(Record.java:55)
   at org.apache.poi.hssf.record.ObjRecord.<init>(ObjRecord.java:61)
   … 66 more

1 REPLY 1

kevinr
Star Contributor
Star Contributor
The index rebuild is complete as per the success message. However, after the main index rebuild, the full-text background indexing of all the documents detected in the new indexes will occur. This will take some time!! And you won't see the results immediately. You will probably see the CPU high during this time.

The exception you posted is one of the documents failing to index due to an exception in the Excel->Txt transformer - it will not cause the reindexing of other documents to fail. Basically you just need to leave the machine to get on with the reindexing of document text in the background.

Try a search in a hour or two to see if it's got further.

Thanks,

Kevin