11-06-2016 04:39 AM
hello
I use Alfresco 5.0.c with Solr4.
I introduced in Alfresco about 120.000 doc and the sizes on disc are:
1. 87GB di contentStore with about 288.000 files
2. 64GB di index (for folder d:\alfresco\alf_data\solr4\index\workspace) with about 830 files
3. 15GB di content (for folder d:\alfresco\alf_data\solr4\content\_DEFAULT_) with about 175.000 files
sssMy questions are:
1) 1) Are the sizes normal (64GB of index for 87GB of content) ?
22 2) Is it possible indexing only certain folders? in other word: my contents are organized for year and i think to indexing only last years ( 2016,2015 ). How can do it ?
very thanks in advance
2)
22
Ia
11-06-2016 08:59 AM
"Normal" sizes are relative. It depends heavily on the type of documents that you store, specifically if / how much textual content Alfresco can extract from those files and index. 64GiB sound like a bit much considering 87 GiB of content, but if that content is all text or even stored in compressed format, it could be reasonable.
You can use an aspect in Alfresco to exclude files / folders from indexing. The cm:indexControl aspect can be used to do this. This aspect provides a cm:isIndexed property which can be set to false to completely exclude the item from indexing, while another property cm:isContentIndexed can be set to false to exclude the file content from being indexed while the file itself is still indexed using its metadata.
11-06-2016 01:21 PM
Hope these links helps:
- How to avoid indexing full content in Alfresco - zylk
- How to deactivate metadata extraction in Alfresco - zylk
Regards.
--C.
Tags
Find what you came for
We want to make your experience in Hyland Connect as valuable as possible, so we put together some helpful links.