<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Lucene PATH wildcard query performance in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/lucene-path-wildcard-query-performance/m-p/249336#M202466</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I'm developing an application that will use Alfresco as a backend content repository.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I have a use case I'm implementing currently, where I would need to run frequent PATH wildcard queries&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;on the repository and I'm trying to figure out if this type of query will pose performance problems.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The PATH queries contain wildcards in the middle of the path and are of the form:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;PATH:"/app:company_home/cm:foo/cm:bar/*/*/cm:baz/*"&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;How does Alfresco implement queries of this form?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;With Lucene query language this type of query is implemented using the Lucene&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;index but the index seems to only store information about the immediate parent of a node&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;(not ancestors or full path). Is this correct?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;This seems to imply that the Alfresco query engine can't get the results using a simple&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;index search but needs to process each level of wildcards in the middle of the path separetely, right?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;In our case the hierarchy levels in which the 2 wildcards in the middle appear would&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;contain a fairly large number of nodes (~1000 each).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I'm using Alfresco v3.3 and the Alfresco Web Service client API for running the queries.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 04 Nov 2010 07:47:45 GMT</pubDate>
    <dc:creator>aspa</dc:creator>
    <dc:date>2010-11-04T07:47:45Z</dc:date>
    <item>
      <title>Lucene PATH wildcard query performance</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-path-wildcard-query-performance/m-p/249336#M202466</link>
      <description>Hi,I'm developing an application that will use Alfresco as a backend content repository.I have a use case I'm implementing currently, where I would need to run frequent PATH wildcard querieson the repository and I'm trying to figure out if this type of query will pose performance problems.The PATH q</description>
      <pubDate>Thu, 04 Nov 2010 07:47:45 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-path-wildcard-query-performance/m-p/249336#M202466</guid>
      <dc:creator>aspa</dc:creator>
      <dc:date>2010-11-04T07:47:45Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene PATH wildcard query performance</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-path-wildcard-query-performance/m-p/249337#M202467</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;from my experience the PATH queries are rather slow and if you would have large repository. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;To be sure try to generate many spaces and then try you search.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;smicyk&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 06 Nov 2010 19:30:14 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-path-wildcard-query-performance/m-p/249337#M202467</guid>
      <dc:creator>smicyk</dc:creator>
      <dc:date>2010-11-06T19:30:14Z</dc:date>
    </item>
  </channel>
</rss>

