<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Lucene indexes are around 5 times larger than contentstore? in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118879#M83853</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;sweet. thats got it.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After a load (without closes):&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;25456&amp;nbsp;&amp;nbsp; contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;51460&amp;nbsp;&amp;nbsp; lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After a load (with closes):&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;15164&amp;nbsp;&amp;nbsp; contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;16296&amp;nbsp;&amp;nbsp; lucene-indexes&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 29 Oct 2007 18:37:56 GMT</pubDate>
    <dc:creator>chatch</dc:creator>
    <dc:date>2007-10-29T18:37:56Z</dc:date>
    <item>
      <title>Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118873#M83847</link>
      <description>We've run around 13,000 word and rtf documents into an Alfresco 2.1 instance on Linux. We're finding the lucene-indexes are five times larger than the content we are indexing? Has anyone else seen this?&amp;nbsp;&amp;nbsp;&amp;nbsp; -&amp;gt; 2.8 Gb contentstore&amp;nbsp;&amp;nbsp;&amp;nbsp; -&amp;gt; 15.4 Gb lucene-indexes&amp;nbsp;&amp;nbsp;&amp;nbsp; Regards,Damon.</description>
      <pubDate>Wed, 24 Oct 2007 13:24:57 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118873#M83847</guid>
      <dc:creator>damonrand</dc:creator>
      <dc:date>2007-10-24T13:24:57Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118874#M83848</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;We've now done some further testing around indexes sizes to see why they are so much larger than our content store..&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Below are different tests and the results on the indexes sizes for each.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;These were done sequentially.&amp;nbsp; The last one is the most interesting in that&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;blowing indexes away results in a large decrease. It seems old and &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;presumably unused indexes are hanging around???&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Damon.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After Bootstrap Alfresco:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;4.3M&amp;nbsp;&amp;nbsp;&amp;nbsp; live/lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; live/contentstore.deleted&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.6M&amp;nbsp;&amp;nbsp;&amp;nbsp; live/contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; live/audit.contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;8.8M&amp;nbsp;&amp;nbsp;&amp;nbsp; live/&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After migrating 9 folders with a few hundred files:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;27M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; live/lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; live/contentstore.deleted&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;13M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; live/contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; live/audit.contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;40M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; live/&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;lucene-indexes directory breakdown:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;80K&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes/user&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes/locks&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;48K&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes/archive&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;100K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes/system&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;27M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes/workspace&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After server restarted actually went down a little:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;26M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./contentstore.deleted&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;13M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./audit.contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Set index.recovery.mode=FULL and restarted the server:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;36M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./contentstore.deleted&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;13M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./audit.contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;49M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Set blew away index and set index.recovery.mode=FULL and restarted the &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;server:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;9.6M&amp;nbsp;&amp;nbsp;&amp;nbsp; ./lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./contentstore.deleted&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;13M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ./contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.0K&amp;nbsp;&amp;nbsp;&amp;nbsp; ./audit.contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;23M&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 25 Oct 2007 13:24:30 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118874#M83848</guid>
      <dc:creator>damonrand</dc:creator>
      <dc:date>2007-10-25T13:24:30Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118875#M83849</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;How are you loading this data?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Oct 2007 15:45:43 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118875#M83849</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2007-10-29T15:45:43Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118876#M83850</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;The data is loaded using a migration script which loads using a combination of the following calls:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;nodeService.createNode followed by a contentWriter.putContent into that node&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;fileFolderService.create&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;fileFolderService.copy&amp;nbsp;&amp;nbsp;&amp;nbsp; (from space templates)&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Chris&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Oct 2007 16:03:58 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118876#M83850</guid>
      <dc:creator>chatch</dc:creator>
      <dc:date>2007-10-29T16:03:58Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118877#M83851</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Do you do any queries? Do you make sure you close the result sets?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Oct 2007 16:42:06 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118877#M83851</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2007-10-29T16:42:06Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118878#M83852</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;yes I do and no I don't! I'll close all handles and re run and see what happens … cheers.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Oct 2007 17:30:15 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118878#M83852</guid>
      <dc:creator>chatch</dc:creator>
      <dc:date>2007-10-29T17:30:15Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene indexes are around 5 times larger than contentstore?</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118879#M83853</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;sweet. thats got it.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After a load (without closes):&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;25456&amp;nbsp;&amp;nbsp; contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;51460&amp;nbsp;&amp;nbsp; lucene-indexes&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;After a load (with closes):&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;15164&amp;nbsp;&amp;nbsp; contentstore&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;16296&amp;nbsp;&amp;nbsp; lucene-indexes&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Oct 2007 18:37:56 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-indexes-are-around-5-times-larger-than-contentstore/m-p/118879#M83853</guid>
      <dc:creator>chatch</dc:creator>
      <dc:date>2007-10-29T18:37:56Z</dc:date>
    </item>
  </channel>
</rss>

