cancel
Showing results for 
Search instead for 
Did you mean: 

SOLR4 content folder

pero
Champ in-the-making
Champ in-the-making
I have disabled content indexing in SOLR4 on my Alfresco 5.0.1 installation with alfresco.index.transformContent=false in solrcore.properties. However solr4/content folder still gets filled up with 5.000.000 files and 10GB of space during index rebuild. Why is that? Shouldn't it be empty?
2 REPLIES 2

afaust
Legendary Innovator
Legendary Innovator
Hello,

SOLR 4 content folder is not filled with what you consider to be "content", that is full text contents of documents being index. The content folder holds compressed JSON files containing index data for all documents / folders. This includes metadata and path information, not just full text data. So even when you disable full text indexing will SOLR create those files but they should be very, very small and only contain metadata, ACLs and such.

As far as I can see, there is no way to disable this behaviour. It is intended as a cache mechanism to avoid SOLR having to re-fetch all the data for a node whenever it has to re-index a document without a change to the document itself (or with only a limited change), i.e. when a path changes due to some folder in the hierarchy being renamed or an ACL is updated.

Regards
Axel

pero
Champ in-the-making
Champ in-the-making
Thanks for clarifying on that.