<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Lucene query and it's performance results in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169196#M122603</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If you let me nkow the version of Alfresco the explanation is different for older versions ….&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Paths with //* do the structural query in two parts - the directory and then the leaves.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;(it is too expensive to reindex a doc each time its parents structiure may change in lucene as you may do for structural queries using an XML database)&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So /blah/blah//* finds all directories that match the path and then re-queries to find all children. The requery is ordered and about as good as it can gets, although not grouped which may help. It basically depends on the number of directories found.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Lucene shuffles which predicte is used to scan through the parts of a conjunctions. However, the PATH work is done up front so should not have any effect.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;It is done for all of your queries.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So it is most likely you are seeing caching improvements to performance.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Do you queries either all cold or all warm (ignoring the first result and running each at least twice).&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Other background processes can also have an efect.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt; Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 28 Jul 2008 19:52:01 GMT</pubDate>
    <dc:creator>andy</dc:creator>
    <dc:date>2008-07-28T19:52:01Z</dc:date>
    <item>
      <title>Lucene query and it's performance results</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169191#M122598</link>
      <description>Hello,I don't understand the performance differences of the following queries.First we started of with this query (1) &lt;IMG id="smileytongue" class="emoticon emoticon-smileytongue" src="https://migration33.stage.lithium.com/i/smilies/16x16_smiley-tongue.png" alt="Smiley Tongue" title="Smiley Tongue" /&gt;ATH:"/app:company_home/cm:A_Company/cm:A_Space//*" ‍This query (2) includes a custom property check and was equal in performance&lt;IMG id="smileytongue" class="emoticon emoticon-smileytongue" src="https://migration33.stage.lithium.com/i/smilies/16x16_smiley-tongue.png" alt="Smiley Tongue" title="Smiley Tongue" /&gt;ATH:"/app:company_home/cm:A_Company/cm:A_Space//*"</description>
      <pubDate>Fri, 16 May 2008 09:17:11 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169191#M122598</guid>
      <dc:creator>kdejaeger</dc:creator>
      <dc:date>2008-05-16T09:17:11Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query and it's performance results</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169192#M122599</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I have no answer but that is very interesting.&amp;nbsp; I do alot of custom searches where I use the Path quite a bit so I am very interested in this.&amp;nbsp; Does anyone at Alfresco have the answers to this?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 May 2008 14:16:28 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169192#M122599</guid>
      <dc:creator>nyronian</dc:creator>
      <dc:date>2008-05-19T14:16:28Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query and it's performance results</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169193#M122600</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I'm by no means a Lucene expert, but wildcards always give me pause - that would be my guess as to why the queries that include the PATH field are slower.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Cheers,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Peter&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 May 2008 15:23:07 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169193#M122600</guid>
      <dc:creator>pmonks</dc:creator>
      <dc:date>2008-05-19T15:23:07Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query and it's performance results</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169194#M122601</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Not to highjack the question kdejaeger but I have questions around it as well, hopefully my comments help as well.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I assume kdejaeger, the reason you are indicating a path is to find "invoices" of a particular company, not all invoices in the system.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have the same issue, I am looking for content under a particular company, so I indicate the path.&amp;nbsp; The * is only to to indicate looking for all children under the path.&amp;nbsp; Otherwise you may have undesirable results and get nodes you do not wish.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Is there a another way to acheive the same results?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 May 2008 17:40:40 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169194#M122601</guid>
      <dc:creator>nyronian</dc:creator>
      <dc:date>2008-05-19T17:40:40Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query and it's performance results</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169195#M122602</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Yes nyronian, that's exactly why I need the path to be there. I somehow think that the //* is doing a slower xpath expansion. We need some professional advise here from an alfresco engineer to know how this gets treated.&amp;nbsp; :wink:&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 20 May 2008 07:37:39 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169195#M122602</guid>
      <dc:creator>kdejaeger</dc:creator>
      <dc:date>2008-05-20T07:37:39Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query and it's performance results</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169196#M122603</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If you let me nkow the version of Alfresco the explanation is different for older versions ….&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Paths with //* do the structural query in two parts - the directory and then the leaves.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;(it is too expensive to reindex a doc each time its parents structiure may change in lucene as you may do for structural queries using an XML database)&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So /blah/blah//* finds all directories that match the path and then re-queries to find all children. The requery is ordered and about as good as it can gets, although not grouped which may help. It basically depends on the number of directories found.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Lucene shuffles which predicte is used to scan through the parts of a conjunctions. However, the PATH work is done up front so should not have any effect.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;It is done for all of your queries.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So it is most likely you are seeing caching improvements to performance.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Do you queries either all cold or all warm (ignoring the first result and running each at least twice).&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Other background processes can also have an efect.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt; Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 28 Jul 2008 19:52:01 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-and-it-s-performance-results/m-p/169196#M122603</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2008-07-28T19:52:01Z</dc:date>
    </item>
  </channel>
</rss>

