<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Weird Search Result Issue in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210078#M163208</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;It is not "index" (I guess you meant Lucene query) which is cutting result list but rather acl evaluation.&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;To make it clear, I was talking about the Indexing-and-Searching component as a whole. That thingy you get as the SearchService bean, which one has to use in a black-box manner.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;There is lucene hidden below of it, wrapped by a ACL-checker, which also cuts due to time constraints.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;Number of hits returned from Lucene is correct&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;Not in a cluster environment. Some transactions might be missing in the local lucene index of an Alfresco instance.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; lothar&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 09 Feb 2009 22:18:34 GMT</pubDate>
    <dc:creator>lotharm</dc:creator>
    <dc:date>2009-02-09T22:18:34Z</dc:date>
    <item>
      <title>Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210069#M163199</link>
      <description>We are running alfresco version 2.9 in 4.2.2.GA on Linux 2.6.18-8.el5 (i386).&amp;nbsp; We have created some custom scripts to manage driver documents.&amp;nbsp; It checks to see if a driver has all the required documents on file and when they need updated (they have expired or will expire soon).&amp;nbsp; We are currently ha</description>
      <pubDate>Thu, 29 Jan 2009 19:50:23 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210069#M163199</guid>
      <dc:creator>mcook</dc:creator>
      <dc:date>2009-01-29T19:50:23Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210070#M163200</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;How many hits do you think your Lucene query typically returns? If the number of Lucene hits is large (close to 1000 or more) than ACL evaluation process could affect the number of found nodes.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;There are few parameters which affect this behaviour&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;1. state of ACL cache before the repository is queried&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2. size of ACL cache&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;3. system.acl.maxPermissionCheckTimeMillis defaults to 10000ms and defines maximum time that will be spent on ACL evaluation&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4. system.acl.maxPermissionChecks defaults to 1000 and defines maximum number of ACL evaluations that will be performed for collection of hits that will be returned from Lucene query. If the number of hits exceedes this number, the rest will simply be discarded&amp;nbsp; &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Hope this helps,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Denis&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 29 Jan 2009 21:56:02 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210070#M163200</guid>
      <dc:creator>dhalupa</dc:creator>
      <dc:date>2009-01-29T21:56:02Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210071#M163201</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thank you for the advise.&amp;nbsp; However, I do no think that is the cause of our current issue.&amp;nbsp; Right now, the search is returning about 450 files, but in the future it could return many more.&amp;nbsp; Ill look into changing those flags, and Ill hope this fixes things, but I remain skeptical that it will.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 30 Jan 2009 14:56:34 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210071#M163201</guid>
      <dc:creator>mcook</dc:creator>
      <dc:date>2009-01-30T14:56:34Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210072#M163202</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;If I understood you correctly, this is some kind of system task which runs periodically. If this is the case, than there is one other thing you might consider. You could try using SearchService without transaction, security and audit interceptors applied. The id of that bean is "searchService" and in that case you will bypass acl evaluation completely. The only thing that you have to be careful about is that you have to create transaction boundaries yourself since it will not be handled by spring. I'm not sure how you will access this bean from JavaScript though, since I'm pretty sure that it is not injected into the JavaScript context.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Kind regards,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Denis&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 30 Jan 2009 16:07:28 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210072#M163202</guid>
      <dc:creator>dhalupa</dc:creator>
      <dc:date>2009-01-30T16:07:28Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210073#M163203</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I have solved our issue, but I never found out the root cause of it.&amp;nbsp; I would like to figure that out however, to prevent this from happening again.&amp;nbsp; The documents were either not indexed or improperly indexed.&amp;nbsp; A server restart solved the issue because by default Alfresco does some reindexing on startup.&amp;nbsp; At least this is my best guess.&amp;nbsp; Our server needs to remain up as much as possible (only restarted during maintenance windows); however, if this happens again we will inevitably require a restart unless another solution is found.&amp;nbsp; Does anyone have any thoughts?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 04 Feb 2009 21:14:05 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210073#M163203</guid>
      <dc:creator>mcook</dc:creator>
      <dc:date>2009-02-04T21:14:05Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210074#M163204</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;How the daily is triggered? Perhaps it runs as guest user?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;However, I would like to comment on the general approach you took, because we also took the same search-the-index-approach once, but concluded it is not the right way. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Querying the index for a certain property value (valid, expired or the like) to produce report-like lists seems like a reasonable idea. But it turns out that&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;the result list is not stable. That means, doing the same query twice, &lt;/SPAN&gt;&lt;EM&gt;possibly&lt;/EM&gt;&lt;SPAN&gt; gives a longer list. &lt;/SPAN&gt;&lt;EM&gt;Possibly&lt;/EM&gt;&lt;SPAN&gt;, because it is usually not visible on small systems, but with heavier load it might happen more often.The driving factors rise:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;* a loaded system may hit the value of system.acl.maxPermissionCheckTimeMillis=10 seconds.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;* a big result list may hit the value of system.acl.maxPermissionChecks=1000&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;* In a cluster setup, where every instance has its own index, a document might be missing for a short time in the index.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;…and the result will just be cut. Very difficult to reproduce, too.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Using the index for something else than full text search is a debatable thing then. In my eyes, a lookup by a property value as above is not a thing for a full-text index, instead its a thing for the relational database. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I think Alfresco did a good job on the full text index. It is fast, respects transactions and the results for full-text search are well ranked. But it has to be used for what it was build for. &lt;/SPAN&gt;&lt;EM&gt;By design&lt;/EM&gt;&lt;SPAN&gt; the index is cutting the result list. This improves search speed and there is no need for the 10.000th result hit in a full text search. Did one ever go the second page on google?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Though, we solved our issue by using direct database queries through hibernate with success.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Hoping this gives some guidance,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; lothar&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 05 Feb 2009 09:32:26 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210074#M163204</guid>
      <dc:creator>lotharm</dc:creator>
      <dc:date>2009-02-05T09:32:26Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210075#M163205</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Is it possible to do a hibernate query from the Javascript API?&amp;nbsp; A quick google search did not turn up much.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Feb 2009 18:04:47 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210075#M163205</guid>
      <dc:creator>mcook</dc:creator>
      <dc:date>2009-02-09T18:04:47Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210076#M163206</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;How the daily is triggered? Perhaps it runs as guest user?&lt;BR /&gt;&lt;EM&gt;By design&lt;/EM&gt; the index is cutting the result list.&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;It is not "index" (I guess you meant Lucene query) which is cutting result list but rather acl evaluation. Number of hits returned from Lucene is correct&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Feb 2009 18:47:53 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210076#M163206</guid>
      <dc:creator>dhalupa</dc:creator>
      <dc:date>2009-02-09T18:47:53Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210077#M163207</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;Is it possible to do a hibernate query from the Javascript API?&amp;nbsp; A quick google search did not turn up much.&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;No, it is not possible&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Feb 2009 18:49:02 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210077#M163207</guid>
      <dc:creator>dhalupa</dc:creator>
      <dc:date>2009-02-09T18:49:02Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210078#M163208</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;It is not "index" (I guess you meant Lucene query) which is cutting result list but rather acl evaluation.&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;To make it clear, I was talking about the Indexing-and-Searching component as a whole. That thingy you get as the SearchService bean, which one has to use in a black-box manner.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;There is lucene hidden below of it, wrapped by a ACL-checker, which also cuts due to time constraints.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;Number of hits returned from Lucene is correct&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;Not in a cluster environment. Some transactions might be missing in the local lucene index of an Alfresco instance.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; lothar&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Feb 2009 22:18:34 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210078#M163208</guid>
      <dc:creator>lotharm</dc:creator>
      <dc:date>2009-02-09T22:18:34Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210079#M163209</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;Not in a cluster environment. Some transactions might be missing in the local lucene index of an Alfresco instance.&lt;/BLOCKQUOTE&gt;&lt;SPAN&gt;Yes, and indexes might also get corrupted, but these are not not normal behaviors, I am referring to situation when indexes are not corrupted and are properly synchronized between cluster nodes. In that case Lucene query will always return a correct number of results, cutting will occur later during acl evaluation.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Feb 2009 22:37:50 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210079#M163209</guid>
      <dc:creator>dhalupa</dc:creator>
      <dc:date>2009-02-09T22:37:50Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210080#M163210</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Would you mind showing some examples of using the direct database queries through hibernate?&amp;nbsp; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I thought that direct access to the Alfresco database was not recommended.&amp;nbsp; Did any further problems arise with your solution?&amp;nbsp; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Would an XPath query based on node service provide consistent/stable results, or would that have the same problem with inconsistent results as Lucene did?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 04 May 2009 15:17:08 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210080#M163210</guid>
      <dc:creator>ebell</dc:creator>
      <dc:date>2009-05-04T15:17:08Z</dc:date>
    </item>
    <item>
      <title>Re: Weird Search Result Issue</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210081#M163211</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;After some further investigation, I've changed &lt;/SPAN&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;search.luceneSearch("TYPE:\""+BURRIS_DOC_TYPE+"\" +ASPECT:\""+EXPIREABLE_ASPECT+"\" -ASPECT:\""+SUPERCEDED_ASPECT+"\" -ASPECT:\""+TERMINATED_ASPECT+"\"");&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;SPAN&gt; to &lt;/SPAN&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;search.xpathSearch("TYPE:\""+BURRIS_DOC_TYPE+"\" +ASPECT:\""+EXPIREABLE_ASPECT+"\" -ASPECT:\""+SUPERCEDED_ASPECT+"\" -ASPECT:\""+TERMINATED_ASPECT+"\"");&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;SPAN&gt;etc. in all instances in the javascript code.&amp;nbsp; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;It still appears that Lucene is being run to do part of the searching.&amp;nbsp; However, as in the one post above, I'm looking to create a stable report every time.&amp;nbsp; Am I heading in the right direction, or would using Hibernate be the way to go?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 06 May 2009 19:34:54 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/weird-search-result-issue/m-p/210081#M163211</guid>
      <dc:creator>ebell</dc:creator>
      <dc:date>2009-05-06T19:34:54Z</dc:date>
    </item>
  </channel>
</rss>

