<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Search content without extension in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6872#M3321</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I can't say how SOLR / Lucene handles this low level. I just remember issues with running into maxBooleanClause limits with Alfresco SOLR before due to the way that Alfresco was rewriting wildcard queries before sending them off to the SOLR / Lucene layer. Though this may have changed in Alfresco 5.0 or later versions...&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 13 Mar 2017 11:50:31 GMT</pubDate>
    <dc:creator>afaust</dc:creator>
    <dc:date>2017-03-13T11:50:31Z</dc:date>
    <item>
      <title>Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6868#M3317</link>
      <description>Hello,I use alfresco community 5.0.d and i would like to know if it is possible to search in Alfresco all the files thatdon't have an extension. I do not know how to do the search.Thanks,&amp;nbsp;Matthieu</description>
      <pubDate>Fri, 10 Mar 2017 15:01:14 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6868#M3317</guid>
      <dc:creator>mattjourdan</dc:creator>
      <dc:date>2017-03-10T15:01:14Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6869#M3318</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Depends on which Search you want to use. If using the "Aikau" Search in Share or the Alfresco FTS , the Searchstring !=cm:name:*.??? should do it. It should find all nodes not having a name that ends with a three character extension. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Mar 2017 18:32:11 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6869#M3318</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2017-03-11T18:32:11Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6870#M3319</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The question isn't necessarily a matter of which UI you use (Aikau faceted search or Node Browser for instances), but if the search services support this type of query. The problem with a wildcard based approach in FTS is that it will by design only scale to a certain amount of documents in the system. This is a result of how the query is translated to the underlying Lucene system in SOLR. Also, the pattern *.??? assumes that all extensions are three-letter extensions only which might have been the standard in the old DOS 8.3 world but all modern MS Office extensions are four-lettered ones.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Without having done a similar query myself on a large document base (i.e. more than just a couple tens of thousands of documents), I would assume the best way to work with this is by doing a CMIS query using the LIKE operator on cmis:name. The reasoning behind this is that a CMIS query using LIKE can actually be applied against the database instead of the SOLR index, and thus is not limited by the index query rewrite restrictions. The only thing you need to ensure is that the additional indexes for transactional metadata queries have been applied on the database system.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Mar 2017 08:42:37 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6870#M3319</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-03-13T08:42:37Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6871#M3320</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Axel, I mentioned "Aikau" because it's the easiest way to test the FTS String. The query performs well on large document sets (tested with 1000.000 doc repo ) , but &amp;nbsp;paging throu large resultset gets slower for following pages (and gets worse page by page)&lt;/P&gt;&lt;P&gt;It's true it finds only three character extensions, but is easy to adapt &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I used ??? because I thought Solr would internally invert the query string (???.*) which would not be so expensive - do you know if this is correct?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Mar 2017 11:32:17 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6871#M3320</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2017-03-13T11:32:17Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6872#M3321</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I can't say how SOLR / Lucene handles this low level. I just remember issues with running into maxBooleanClause limits with Alfresco SOLR before due to the way that Alfresco was rewriting wildcard queries before sending them off to the SOLR / Lucene layer. Though this may have changed in Alfresco 5.0 or later versions...&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Mar 2017 11:50:31 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6872#M3321</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-03-13T11:50:31Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6873#M3322</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Max boolean clauses should be no problem here - Hits you when using big "or" conjunctions. I hoped to eliminate that by using the '=' Operator. (You see that I used 'hope' - what would we do without it &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Mar 2017 12:17:41 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6873#M3322</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2017-03-13T12:17:41Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6874#M3323</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I tried in Alfresco Search with a small set&amp;nbsp;in a&amp;nbsp;site --&amp;gt; &lt;STRONG&gt;TYPE:"cm:content" AND !=cm:name:*.*&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;And then I played a little bit with mimetype&amp;nbsp;facet, considering "&lt;SPAN class="" data-dojo-attach-event="ondijitclick:onToggleFilter" data-dojo-attach-point="labelNode" style="color: #333333;"&gt;Binary File (Octet Stream)", HTML and text mimetypes. I obtained some meaningful list, although not exactly accurate. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="" data-dojo-attach-event="ondijitclick:onToggleFilter" data-dojo-attach-point="labelNode" style="color: #333333;"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="" data-dojo-attach-event="ondijitclick:onToggleFilter" data-dojo-attach-point="labelNode" style="color: #333333;"&gt;- What about a database query for doing LIKEs ?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="" data-dojo-attach-event="ondijitclick:onToggleFilter" data-dojo-attach-point="labelNode" style="color: #333333;"&gt;- What about a recursive javascript function checking the filenames with some js regex ?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333;"&gt;Regards.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333;"&gt;--C.&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Mar 2017 23:13:48 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6874#M3323</guid>
      <dc:creator>cesarista</dc:creator>
      <dc:date>2017-03-13T23:13:48Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6875#M3324</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Recursive analysis via JavaScript is out of the question. This will be extremely slow, load too much data into Memory (overwhelming Caches) and potentially lock rows/tables in the DB (some DB systems have lock escalation functionalitites when too many rows are read in a transaction).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As I said, you could try CMIS SQL queries using LIKE against the DB - in that case you would basically only test for the presence of a dot, e.g. do a where cmis:name NOT LIKE '%.%' query...&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Mar 2017 07:55:19 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6875#M3324</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-03-14T07:55:19Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6876#M3325</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I think you could use a recursive script, but only for admin use/purpose. Runtime of the script would be long - the browser will probably run in a timeout error, but there should be no lock problem because the script would only do read access which will result in "Shared locks" on the db which are not causing lock escalations BUT if there is any insert or update request on the dataset, it will block the script until the Update/insert is completed.&lt;/P&gt;&lt;P&gt;But your cmis variant is far better. Is there any restriction on the number of fetched results when executing a cmis db query like in the early search service?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Mar 2017 09:00:46 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6876#M3325</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2017-03-14T09:00:46Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6877#M3326</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;DIV class=""&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P style="min-height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks a lof for your answers.&lt;/P&gt;&lt;P&gt;I will try CMIS SQL queries&lt;/P&gt;&lt;P style="min-height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Matthieu&lt;/P&gt;&lt;/DIV&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Mar 2017 09:12:49 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6877#M3326</guid>
      <dc:creator>mattjourdan</dc:creator>
      <dc:date>2017-03-14T09:12:49Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6878#M3327</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You can try CMIS SQL queries in&amp;nbsp;node browser (cmis-strict or cmis-alfresco) or CMIS workbench first, before writing a custom CMIS code for example in java or python.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Regards.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;--C.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Mar 2017 09:39:59 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6878#M3327</guid>
      <dc:creator>cesarista</dc:creator>
      <dc:date>2017-03-14T09:39:59Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6879#M3328</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;a last update for this one&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TYPE:"cm:content" !=cm:name:*.?*&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Did it for me. Found all files without an,&amp;nbsp;at least one character, extension. AND is implicit in the newer Alfresco versions.&lt;/P&gt;&lt;P&gt;&lt;A __default_attr="80296" __jive_macro_name="user" _jive_internal="true" data-id="80296" data-objecttype="3" data-type="person" href="https://community.alfresco.com/people/cesarista" rel="nofollow noopener noreferrer"&gt;&lt;/A&gt;‌ filter cm:content was a good idea, missed it in the first shot (boaah... so many nodes without extension &lt;img id="smileyhappy" class="emoticon emoticon-smileyhappy" src="https://connect.hyland.com/i/smilies/16x16_smiley-happy.png" alt="Smiley Happy" title="Smiley Happy" /&gt;&amp;nbsp;)&amp;nbsp;&lt;/P&gt;&lt;P&gt;I used the slingshot search via an angularjs SPA, so the count of documents without extension was present in milliseconds. A feature of Solr, which gives you the count of matches directly in the result header.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 23 Mar 2017 17:28:18 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6879#M3328</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2017-03-23T17:28:18Z</dc:date>
    </item>
    <item>
      <title>Re: Search content without extension</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6880#M3329</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;As I wrote in another thread (&lt;A href="https://migration33.stage.lithium.com/thread/230576"&gt;Get all Childs of a Node&lt;/A&gt; )&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;I totally forgot about&amp;nbsp;&lt;A href="https://github.com/ciber/alfresco-js-batch-executer" rel="nofollow noopener noreferrer" target="_blank"&gt;GitHub - ciber/alfresco-js-batch-executer: Alfresco easy bulk processing with JavaScript&lt;/A&gt;&lt;/P&gt;&lt;P&gt;this gives you a new root scope object which enables you to write a serverside JavaScript that iterates over the elements without the common&amp;nbsp;problems like runtime, memory consumption, transaction size...&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;just for completeness...&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 04 Apr 2017 20:42:18 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/search-content-without-extension/m-p/6880#M3329</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2017-04-04T20:42:18Z</dc:date>
    </item>
  </channel>
</rss>

