<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: RC 1 : search : strange search behaviour in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4272#M568</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have use company-relatively-secret document so I'm afraid I can't export my space, right now. If you can't reproduce the bug, I'll try to take them off.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;My document is a two lines html document of french spoonerisms. Below is a cut and paste of the source. I can also send it by email if necessary.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;lt;html&amp;gt;&amp;lt;head&amp;gt;&amp;lt;/head&amp;gt;&amp;lt;body&amp;gt;&amp;lt;p&amp;gt;&amp;lt;font size="2"&amp;gt;La &amp;lt;u&amp;gt;Ch&amp;lt;/u&amp;gt;ine se dresse ÃƒÂ&amp;nbsp; la vue des ni&amp;lt;u&amp;gt;pp&amp;lt;/u&amp;gt;ons&amp;lt;/font&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;p&amp;gt;&amp;lt;font size="2"&amp;gt;&amp;lt;u&amp;gt;T&amp;lt;/u&amp;gt;aisez-vous, en &amp;lt;u&amp;gt;b&amp;lt;/u&amp;gt;as&amp;lt;/font&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;p&amp;gt;&amp;lt;font size="2"&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/body&amp;gt;&amp;lt;/html&amp;gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The file looks like that.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;La &lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;Ch&lt;/SPAN&gt;&lt;SPAN&gt;ine se dresse ÃƒÂ&amp;nbsp; la vue des ni&lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;pp&lt;/SPAN&gt;&lt;SPAN&gt;ons&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN style="text-decoration: underline;"&gt;T&lt;/SPAN&gt;&lt;SPAN&gt;aisez-vous, en &lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;b&lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;It is located in the company home of my repository.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The unsuccessful search was on "chine", the document contains "&lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;Ch&lt;/SPAN&gt;&lt;SPAN&gt;ine". Unsuccessful too for "nippons", "taisez", "bas" which are the other underlined words.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Looking for "vue" or "dresse" retrieves the document.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;HIH&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jerome BATON&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 10 Oct 2005 10:04:32 GMT</pubDate>
    <dc:creator>jbaton</dc:creator>
    <dc:date>2005-10-10T10:04:32Z</dc:date>
    <item>
      <title>RC 1 : search : strange search behaviour</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4270#M566</link>
      <description>Hi all,I created a very simple document with the web client's editor.Some words were underlined. Searching for those words did not retrieve the document.I use RC1 on w2k</description>
      <pubDate>Mon, 10 Oct 2005 08:58:14 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4270#M566</guid>
      <dc:creator>jbaton</dc:creator>
      <dc:date>2005-10-10T08:58:14Z</dc:date>
    </item>
    <item>
      <title>Re: RC 1 : search : strange search behaviour</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4271#M567</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks for reporting the issue.&amp;nbsp; We're attempting to reproduce but could do with some more detail.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Could you provide:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;a) the text you've entered into the in-line editor &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;b) the search criteria&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Also, if you could provide an export file (.acp) of your data that would also help as we'll able to see the structure of your spaces etc.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 10 Oct 2005 09:41:42 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4271#M567</guid>
      <dc:creator>davidc</dc:creator>
      <dc:date>2005-10-10T09:41:42Z</dc:date>
    </item>
    <item>
      <title>Re: RC 1 : search : strange search behaviour</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4272#M568</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have use company-relatively-secret document so I'm afraid I can't export my space, right now. If you can't reproduce the bug, I'll try to take them off.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;My document is a two lines html document of french spoonerisms. Below is a cut and paste of the source. I can also send it by email if necessary.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;lt;html&amp;gt;&amp;lt;head&amp;gt;&amp;lt;/head&amp;gt;&amp;lt;body&amp;gt;&amp;lt;p&amp;gt;&amp;lt;font size="2"&amp;gt;La &amp;lt;u&amp;gt;Ch&amp;lt;/u&amp;gt;ine se dresse ÃƒÂ&amp;nbsp; la vue des ni&amp;lt;u&amp;gt;pp&amp;lt;/u&amp;gt;ons&amp;lt;/font&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;p&amp;gt;&amp;lt;font size="2"&amp;gt;&amp;lt;u&amp;gt;T&amp;lt;/u&amp;gt;aisez-vous, en &amp;lt;u&amp;gt;b&amp;lt;/u&amp;gt;as&amp;lt;/font&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;p&amp;gt;&amp;lt;font size="2"&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/body&amp;gt;&amp;lt;/html&amp;gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The file looks like that.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;La &lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;Ch&lt;/SPAN&gt;&lt;SPAN&gt;ine se dresse ÃƒÂ&amp;nbsp; la vue des ni&lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;pp&lt;/SPAN&gt;&lt;SPAN&gt;ons&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN style="text-decoration: underline;"&gt;T&lt;/SPAN&gt;&lt;SPAN&gt;aisez-vous, en &lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;b&lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;It is located in the company home of my repository.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The unsuccessful search was on "chine", the document contains "&lt;/SPAN&gt;&lt;SPAN style="text-decoration: underline;"&gt;Ch&lt;/SPAN&gt;&lt;SPAN&gt;ine". Unsuccessful too for "nippons", "taisez", "bas" which are the other underlined words.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Looking for "vue" or "dresse" retrieves the document.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;HIH&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jerome BATON&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 10 Oct 2005 10:04:32 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4272#M568</guid>
      <dc:creator>jbaton</dc:creator>
      <dc:date>2005-10-10T10:04:32Z</dc:date>
    </item>
    <item>
      <title>Re: RC 1 : search : strange search behaviour</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4273#M569</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi David,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Well, I was a bit puzzled with the behaviour described above. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;You will notice that the search produces this type of results with word which typo is not the same on all letters.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Due to morning inspiration, I can tell you that &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;traffic&lt;/STRONG&gt;&lt;SPAN&gt;jam causes the same results as the previous post's spoonerisms.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;HIH&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Can you reproduce the 'bug' ? &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I don't know if you are the lucene guy in your team but I think it is worth looking at the lucene buglist about the html-document-indexer&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jerome&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 11 Oct 2005 08:46:18 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4273#M569</guid>
      <dc:creator>jbaton</dc:creator>
      <dc:date>2005-10-11T08:46:18Z</dc:date>
    </item>
    <item>
      <title>Re: RC 1 : search : strange search behaviour</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4274#M570</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Jerome,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We can now reproduce the bug as you've reported.&amp;nbsp; The underlying html text extraction is at fault, so we'll be able to resolve this in a build soon.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Use &lt;/SPAN&gt;&lt;A href="http://www.alfresco.org/jira/browse/AR-163" rel="nofollow noopener noreferrer"&gt;http://www.alfresco.org/jira/browse/AR-163&lt;/A&gt;&lt;SPAN&gt; to track progress of fix.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 11 Oct 2005 09:00:24 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/rc-1-search-strange-search-behaviour/m-p/4274#M570</guid>
      <dc:creator>davidc</dc:creator>
      <dc:date>2005-10-11T09:00:24Z</dc:date>
    </item>
  </channel>
</rss>

