<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: SOLR4 content folder in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309093#M262223</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;SOLR 4 content folder is not filled with what you consider to be "content", that is full text contents of documents being index. The content folder holds compressed JSON files containing index data for all documents / folders. This includes metadata and path information, not just full text data. So even when you disable full text indexing will SOLR create those files but they should be very, very small and only contain metadata, ACLs and such.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;As far as I can see, there is no way to disable this behaviour. It is intended as a cache mechanism to avoid SOLR having to re-fetch all the data for a node whenever it has to re-index a document without a change to the document itself (or with only a limited change), i.e. when a path changes due to some folder in the hierarchy being renamed or an ACL is updated.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Axel&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 02 Nov 2015 08:35:54 GMT</pubDate>
    <dc:creator>afaust</dc:creator>
    <dc:date>2015-11-02T08:35:54Z</dc:date>
    <item>
      <title>SOLR4 content folder</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309092#M262222</link>
      <description>I have disabled content indexing in SOLR4 on my Alfresco 5.0.1 installation with alfresco.index.transformContent=false in solrcore.properties. However solr4/content folder still gets filled up with 5.000.000 files and 10GB of space during index rebuild. Why is that? Shouldn't it be empty?</description>
      <pubDate>Sun, 01 Nov 2015 15:12:17 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309092#M262222</guid>
      <dc:creator>pero</dc:creator>
      <dc:date>2015-11-01T15:12:17Z</dc:date>
    </item>
    <item>
      <title>Re: SOLR4 content folder</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309093#M262223</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;SOLR 4 content folder is not filled with what you consider to be "content", that is full text contents of documents being index. The content folder holds compressed JSON files containing index data for all documents / folders. This includes metadata and path information, not just full text data. So even when you disable full text indexing will SOLR create those files but they should be very, very small and only contain metadata, ACLs and such.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;As far as I can see, there is no way to disable this behaviour. It is intended as a cache mechanism to avoid SOLR having to re-fetch all the data for a node whenever it has to re-index a document without a change to the document itself (or with only a limited change), i.e. when a path changes due to some folder in the hierarchy being renamed or an ACL is updated.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Axel&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 02 Nov 2015 08:35:54 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309093#M262223</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2015-11-02T08:35:54Z</dc:date>
    </item>
    <item>
      <title>Re: SOLR4 content folder</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309094#M262224</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thanks for clarifying on that. &lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 04 Nov 2015 10:34:41 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr4-content-folder/m-p/309094#M262224</guid>
      <dc:creator>pero</dc:creator>
      <dc:date>2015-11-04T10:34:41Z</dc:date>
    </item>
  </channel>
</rss>

