cancel
Showing results for 
Search instead for 
Did you mean: 

Relationship between alf_transaction and index recovery

iblanco
Confirmed Champ
Confirmed Champ
I've a repository with around 25000 documents, we have just uploaded all of them and executed some actions that set some metadata on them. "alf_node" table is around 145000 rows long and alf_transaction around 275000. In our setup we hace 2 Alfresco Community 4.0.d machines in cluster, using ehcache and index tracking to make the synchronization. We are not using SOLR, just plain old Lucene.

Much of the content is not still indexed but as far as I know this is a process that is done in background based on the existence of FTSSTATUS fields on the lucene indexes. The problem is that after some trouble we stopped both machines and now when I start them with the index recovery set to AUTO it starts the process but does seem to take too long, over 7 hours now and still going. I understand that the full content indexing might take too long, but "the other indexing" the one that is made in foreground should be quite fast.

I suspect it might be related to the fact that the recovery process is trying to "recover" all the transactions and those seem to be too many.
Does the recovery process consider all the transactions in alf_transaction of just the last N ? Is it time based ? I think alf_transaction is emptied by a scheduled job, but when is it safe deleting a transaction entry from this table ?

I know does are quite a lot of question but if someone could seed some light around how transaction table and index recovery process are related that would be really helpful.

Thank you very much.
10 REPLIES 10

qsdmv
Champ in-the-making
Champ in-the-making
I had webscript to split the big transaction by borrowing partial codes. It did split transaction (with 240000 nodes) into small grouped transaction with 50 nodes. But It still takes long time to do full indexing with high CPU utilization and eating 82% memory, eventually got OutOfMemory error. (physical memory 8G, JVM maximum heap size 6G with more than 6 million contents). Would you let me know how many contents you have and what is the maximum heap size you set and your available physical memory. Thanks in advance.
Getting started

Tags


Find what you came for

We want to make your experience in Hyland Connect as valuable as possible, so we put together some helpful links.