<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Lucene query problem (too much results and lacking results) in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265530#M218660</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;My goal is to schedule an action for every file in a given directory(recursively). &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I started with a simple action just printing out file name to log and implementing cron + lucene query configuration in xml as stated here &lt;/SPAN&gt;&lt;A href="http://wiki.alfresco.com/wiki/Scheduled_Actions" rel="nofollow noopener noreferrer"&gt;http://wiki.alfresco.com/wiki/Scheduled_Actions&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Problem instance:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have a 'testDir' in company home with one file 'testFile'.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;But my action returns 3 filenames: 'testFile' , 'doclib' , 'webpreview'.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I tried with other directiories, 'doclib' and 'webpreview' were always there. Sometimes it also happend that &lt;/SPAN&gt;&lt;EM&gt;guid-like filenames&lt;/EM&gt;&lt;SPAN&gt; were printed out, and &lt;/SPAN&gt;&lt;SPAN style="color:#FF0000;"&gt;in one case one of the files was omitted.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Important parts of my implementation:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Scheduling config:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;&lt;BR /&gt;&amp;lt;property name="queryTemplate"&amp;gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;value&amp;gt;+PATH:"/app:company_home/cm:testDir//*"&amp;lt;/value&amp;gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;/property&amp;gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;property name="cronExpression"&amp;gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;value&amp;gt;0 0/1 * * * ?&amp;lt;/value&amp;gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;/property&amp;gt;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;Filename logging in Action:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;String fileName = (String) nodeService.getProperty(actionedUponNodeRef, ContentModel.PROP_NAME);&lt;BR /&gt;logger.debug("filename: " + fileName);&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;So the questions are:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;- is this lucene query somehow incomplete? (it occured that one of the existing files was omitted)&lt;/SPAN&gt;&lt;EM&gt;EDIT: may it be caused by 'stale' lucene indexes? &lt;/EM&gt;&lt;BR /&gt;&lt;SPAN&gt;- I guess there are some 'spiecial hidden files' which are not shown by Share client. But how should I write the lucene query to get only files which are 'human relevant'? (without doclib, webpreview, and guid named files rubbish)&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 12 Jul 2012 13:07:45 GMT</pubDate>
    <dc:creator>michalwrobel</dc:creator>
    <dc:date>2012-07-12T13:07:45Z</dc:date>
    <item>
      <title>Lucene query problem (too much results and lacking results)</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265530#M218660</link>
      <description>My goal is to schedule an action for every file in a given directory(recursively). I started with a simple action just printing out file name to log and implementing cron + lucene query configuration in xml as stated here http://wiki.alfresco.com/wiki/Scheduled_ActionsProblem instance:I have a 'test</description>
      <pubDate>Thu, 12 Jul 2012 13:07:45 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265530#M218660</guid>
      <dc:creator>michalwrobel</dc:creator>
      <dc:date>2012-07-12T13:07:45Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query problem (too much results and lacking results)</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265531#M218661</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Your lucene query is asking for members of a folder and any of its descendents. Your query is not filtering on type, aspect, or anything else. So, you're getting exactly what you asked for. &lt;img id="smileyhappy" class="emoticon emoticon-smileyhappy" src="https://connect.hyland.com/i/smilies/16x16_smiley-happy.png" alt="Smiley Happy" title="Smiley Happy" /&gt; The thumbnails that are generated for things like the document library and the preview (technically these are called "renditions") are stored as children of the objects they are thumbnails of.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If you don't want to see those, be more specific. Maybe there is a specific type you are interested in or some types or aspects you could exclude. For example, you could exclude the thumbnails by doing something like:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;PATH:"/app:company_home/cm:testDir//*" AND -TYPE:"cm:thumbnail"&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jeff&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 12 Jul 2012 14:38:53 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265531#M218661</guid>
      <dc:creator>jpotts</dc:creator>
      <dc:date>2012-07-12T14:38:53Z</dc:date>
    </item>
    <item>
      <title>Re: Lucene query problem (too much results and lacking results)</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265532#M218662</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thank you very much that solved the problem! &lt;img id="smileywink" class="emoticon emoticon-smileywink" src="https://connect.hyland.com/i/smilies/16x16_smiley-wink.png" alt="Smiley Wink" title="Smiley Wink" /&gt;&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 16 Jul 2012 07:51:41 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/lucene-query-problem-too-much-results-and-lacking-results/m-p/265532#M218662</guid>
      <dc:creator>michalwrobel</dc:creator>
      <dc:date>2012-07-16T07:51:41Z</dc:date>
    </item>
  </channel>
</rss>

