<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Performing problems as repository nodes increases. in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169091#M122504</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I've made an index recovery and from 23 folders finished with three, tooked five hours and gain a performance of 54 docs per minute. Still less than espected. Few questions for anybody that Knows.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;1.- How can I check that the automatic process that unifies segments of index in lucene is working.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2.- Why after inserting only 760 docs just with the index recovery done, created 15 folders in lucene-index/SpaceStore.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;3.- Maybe is obvious but not for me, if when searching for docs uses lucene indexes what's the use of Oracle indexes?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Are we&amp;nbsp; duplicating information for nothing?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.-&amp;nbsp; Is there any tunning on Alfresco for lucene porposes or whatever is needed to get performance back?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;5.-&amp;nbsp; Is Alfresco a good repository for milions of documents with little size and lots of custom propierties?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;6.-&amp;nbsp; Did anybody had performance problems whe inserting daily aprox. 15.000 docs and how he solve it?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;7.-&amp;nbsp; It is a curious thing but searching&amp;nbsp; is realy fast, inserting very slow.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;8.-&amp;nbsp; Where can I find documentation on how Alfresco uses lucene if the problem is lucene?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks in advance. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I'm a little scared imagin the repository with 10 million documents and making index recovery every week, and spending another&amp;nbsp; week to do it.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 19 Jun 2008 11:14:14 GMT</pubDate>
    <dc:creator>sidi</dc:creator>
    <dc:date>2008-06-19T11:14:14Z</dc:date>
    <item>
      <title>Performing problems as repository nodes increases.</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169087#M122500</link>
      <description>Via webServices we are inserting scanned documents to an specific space. At the beguining of the process the ratio was 100 documents per minute for nodes 34 KB big and 13 properties for searching. By now after 150.000 documents inserted in alfresco the ratio is reduced to 10 docs. per minute. We hav</description>
      <pubDate>Mon, 16 Jun 2008 10:11:47 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169087#M122500</guid>
      <dc:creator>sidi</dc:creator>
      <dc:date>2008-06-16T10:11:47Z</dc:date>
    </item>
    <item>
      <title>Re: Performing problems as repository nodes increases.</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169088#M122501</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;How many documents you have, on average, in each Space?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Or are you archiving all documents in the same Space?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 16 Jun 2008 19:37:11 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169088#M122501</guid>
      <dc:creator>theorbix</dc:creator>
      <dc:date>2008-06-16T19:37:11Z</dc:date>
    </item>
    <item>
      <title>Re: Performing problems as repository nodes increases.</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169089#M122502</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;They are mostly in one space. When you have index.recovery.mode=VALIDATE what exactly means if I stop and restart Alfresco de index will be rebuild?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;thanks for your replay.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 17 Jun 2008 09:03:13 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169089#M122502</guid>
      <dc:creator>sidi</dc:creator>
      <dc:date>2008-06-17T09:03:13Z</dc:date>
    </item>
    <item>
      <title>Re: Performing problems as repository nodes increases.</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169090#M122503</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Mmmm…. might be wrong, but I probably see where your problem is.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Most document management systems have problems when the quantity of documents contained in a virtual "folder" (or space, to use Alfresco's terminology) increases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I would also expect performance problems in doing queries on very large spaces.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;In the past I worked on commercial products that were showing a noticeable performance degradations (mostly in searching and browsing content in folders) when the quantity of documents in a folder was over 500-1000 items.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Maybe Alfresco's architecture is different and it can easily handle spaces with hundreds of thousands of documents, or even millions of documents.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;But in designing your application, I would suggest to try to implement an automatic "space splitting" algorithm capable of keeping the number of items in the space down to a reasonable limit, and see if this improves the performance of your application.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;From a broader point of view, a comment from Alfresco's engineers here would be greatly appreciated:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;1) what is the "reasonable" number of items that - according to the product architecture and the tests you've done so far (I'm thinking about the Unisys benchmark, for instance) - can be stored in a Space without incurring in performance degradations?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2) what "repository design" practices are recommended for applications that need to store more than one millions of documents?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 17 Jun 2008 09:19:24 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169090#M122503</guid>
      <dc:creator>theorbix</dc:creator>
      <dc:date>2008-06-17T09:19:24Z</dc:date>
    </item>
    <item>
      <title>Re: Performing problems as repository nodes increases.</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169091#M122504</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I've made an index recovery and from 23 folders finished with three, tooked five hours and gain a performance of 54 docs per minute. Still less than espected. Few questions for anybody that Knows.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;1.- How can I check that the automatic process that unifies segments of index in lucene is working.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2.- Why after inserting only 760 docs just with the index recovery done, created 15 folders in lucene-index/SpaceStore.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;3.- Maybe is obvious but not for me, if when searching for docs uses lucene indexes what's the use of Oracle indexes?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Are we&amp;nbsp; duplicating information for nothing?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4.-&amp;nbsp; Is there any tunning on Alfresco for lucene porposes or whatever is needed to get performance back?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;5.-&amp;nbsp; Is Alfresco a good repository for milions of documents with little size and lots of custom propierties?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;6.-&amp;nbsp; Did anybody had performance problems whe inserting daily aprox. 15.000 docs and how he solve it?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;7.-&amp;nbsp; It is a curious thing but searching&amp;nbsp; is realy fast, inserting very slow.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;8.-&amp;nbsp; Where can I find documentation on how Alfresco uses lucene if the problem is lucene?.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks in advance. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I'm a little scared imagin the repository with 10 million documents and making index recovery every week, and spending another&amp;nbsp; week to do it.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 19 Jun 2008 11:14:14 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169091#M122504</guid>
      <dc:creator>sidi</dc:creator>
      <dc:date>2008-06-19T11:14:14Z</dc:date>
    </item>
    <item>
      <title>Re: Performing problems as repository nodes increases.</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169092#M122505</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I am performing something similar.&amp;nbsp; Currently the ingestion process I am using (Via web services) creates folders in a space.&amp;nbsp; We have noticed a couple of performance issues and where looking for recommendations.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;1. On ingestion, I search for the existence of a folder.&amp;nbsp; If I search using an XPATH query, it appears the web service call takes between 15 - 20 seconds to return.&amp;nbsp; If I use a query.getChildren() approach then the I need to page the result sets as these are limited to 1000 records per set.&amp;nbsp; This approach starts out fast, then slows over time as the number of folders increases.&amp;nbsp; Any ideas why the XPATH query would take so long to return?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;2. What is the suggested limit of folders in a space before there is noticeable degradation in performance.&amp;nbsp; In the DM, it seems once there is more than 1000 folders in a space, it takes some time to complete the query and render the page.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I am using Alfresco 3.2 in Tomcat 6.0.18 with MySQL 5 on the backend.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Colin.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 23 Jun 2010 17:59:31 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/performing-problems-as-repository-nodes-increases/m-p/169092#M122505</guid>
      <dc:creator>colindstephenso</dc:creator>
      <dc:date>2010-06-23T17:59:31Z</dc:date>
    </item>
  </channel>
</rss>

