<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: cmis search: jon doe, john p. doe, john doe in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259959#M213089</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;thanks Jeff, but I may have poorly worded the question.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Given a full name phrase that a user puts in such as "john doe", how can I pass that given phrase to return similar documents, such as documents that contain the following that would be considered an 'expected' result for the user supplied phrase:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jon Doe&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;John P Doe&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;John Doe&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 21 Dec 2012 18:25:01 GMT</pubDate>
    <dc:creator>dhartford</dc:creator>
    <dc:date>2012-12-21T18:25:01Z</dc:date>
    <item>
      <title>cmis search: jon doe, john p. doe, john doe</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259957#M213087</link>
      <description>Hi all,I'm wondering if someone has already found an appropriate approach for dealing with search/score for names like these below with CMIS, such that a search phrase "jon doe" or "john doe" returns all three phrases (i.e. all documents that have cmis:name of any of these 3):Jon DoeJohn DoeJohn P.</description>
      <pubDate>Mon, 17 Dec 2012 15:52:49 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259957#M213087</guid>
      <dc:creator>dhartford</dc:creator>
      <dc:date>2012-12-17T15:52:49Z</dc:date>
    </item>
    <item>
      <title>Re: cmis search: jon doe, john p. doe, john doe</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259958#M213088</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;This would do it:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;SELECT * FROM cmis:document&amp;nbsp; WHERE&amp;nbsp; cmis:name like 'Jo%Doe'&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;So would this:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;SELECT * FROM cmis:document&amp;nbsp; WHERE&amp;nbsp; cmis:name = 'Jon Doe' or cmis:name = 'John Doe' or cmis:name = 'John P. Doe'&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;Jeff&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 20 Dec 2012 15:53:17 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259958#M213088</guid>
      <dc:creator>jpotts</dc:creator>
      <dc:date>2012-12-20T15:53:17Z</dc:date>
    </item>
    <item>
      <title>Re: cmis search: jon doe, john p. doe, john doe</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259959#M213089</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;thanks Jeff, but I may have poorly worded the question.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Given a full name phrase that a user puts in such as "john doe", how can I pass that given phrase to return similar documents, such as documents that contain the following that would be considered an 'expected' result for the user supplied phrase:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jon Doe&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;John P Doe&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;John Doe&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 21 Dec 2012 18:25:01 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259959#M213089</guid>
      <dc:creator>dhartford</dc:creator>
      <dc:date>2012-12-21T18:25:01Z</dc:date>
    </item>
    <item>
      <title>Re: cmis search: jon doe, john p. doe, john doe</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259960#M213090</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;The CMIS query examples I gave you query against the name property. If instead you want to search the full-text contents, you can use the CONTAINS keyword in a CMIS query language query. So, for example:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;SELECT * FROM cmis:document WHERE CONTAINS('John Doe')&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;Returns the documents that contain "John Doe" in the text. In my test, I also get a hit for a document containing "John P. Doe".&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The CMIS spec (and the underlying Lucene engine embedded in Alfresco) supports wildcarding in full-text searches. So you might expect to be able to search for:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;SELECT * FROM cmis:document WHERE CONTAINS('Jo*Doe')&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;to get back documents containing the names you listed.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;But in my test on 4.2.c Community Edition with Lucene (not SOLR) this returns zero hits. I think it is because of the word break between "John" and "Doe".&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;So, refining the search to:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;SELECT * FROM cmis:document WHERE CONTAINS('Doe') and CONTAINS('Jo*')&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;I get hits for docs containing Jon Doe, John P Doe, and John Doe.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jeff&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 02 Jan 2013 20:39:12 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259960#M213090</guid>
      <dc:creator>jpotts</dc:creator>
      <dc:date>2013-01-02T20:39:12Z</dc:date>
    </item>
    <item>
      <title>Re: cmis search: jon doe, john p. doe, john doe</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259961#M213091</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;thanks for looking into this Jeff.&amp;nbsp; Unfortunately, it looks like it should be better controlled during the content model design phase by making sure 'first name' and 'last name' are distinct fields if it is a critical search field for documents that only have one name.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If a document has multiple names, or using fulltext on an OCR'd document, just have to manage expectations (particularly with multi-part names with middle initial, hypenated last names, generational suffixes, etc)&amp;nbsp; &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;-D&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 03 Jan 2013 19:52:54 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259961#M213091</guid>
      <dc:creator>dhartford</dc:creator>
      <dc:date>2013-01-03T19:52:54Z</dc:date>
    </item>
    <item>
      <title>Re: cmis search: jon doe, john p. doe, john doe</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259962#M213092</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;You should be able to combine phrase slop and wild-carding&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;CONTAINS('cmis:name:\\'Jo*n Doe\\'~1')&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;~2 will allow more of a gap but will also allow tokens in reverse order but not require then to be next to each other.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;(~1 will match tokens at the same position which can be odd using lucene as opposed to SOLR)&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;~1 allows a token to be out of place by one it is two moves to reverse the order &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Hope this helps&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Span queries etc is still on the list ….&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 07 Jan 2013 21:14:18 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/cmis-search-jon-doe-john-p-doe-john-doe/m-p/259962#M213092</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2013-01-07T21:14:18Z</dc:date>
    </item>
  </channel>
</rss>

