<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: solr performance issues in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265369#M218499</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thanks for the reply, Andy…&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Alfresco 4.0.e on CentOS6 (64bit), 4-cores, 12GB RAM (and that host is only running Solr, so it can technically use anything it wants).&amp;nbsp; Alfresco host is same hw/sw specs.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Not all queries are the same.&amp;nbsp; I try to search for a single word that I know is in roughly 100-200 nodes (folders + documents).&amp;nbsp; However, re-running the same query (over time, not back-to-back) yields almost identical times.&amp;nbsp; We are getting our timing from:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;log4j.logger.org.alfresco.repo.search.impl.solr.SolrQueryHTTPClient=debug&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;SPAN&gt; And we are testing these when no one else is on the system (and one query at a time).&amp;nbsp; Thus, we're almost positive these queries are going to Solr, and the Solr Admin Page does show cache statistics being updated.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;As I think I mentioned, searches against a single, custom property that we have indexed returns very fast.&amp;nbsp; It's when we try putting a value in the "Keywords" box in Advanced Search (even with additional properties), or when we run a "simple search" from the document library view of Share that it completely drags.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We only have one Site in use at this time, and we're using Share to run the tests (except when we try to copy the alfresco-fts text to the Node Browser).&amp;nbsp; I honestly don't know how to re-scope the queries.&amp;nbsp; I believe Share is always defaulting to search the Site, but that's only a SWAG.&amp;nbsp; We never explicitly specify either-way (but could, if you let me know how and you think it'll help troubleshoot).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks again for your reply.&amp;nbsp; I hope this is something we can figure out and help everyone with.&amp;nbsp; Solr should be able to run these kinds of queries against this "library" in sub-second speed, but I'm not even looking for that.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Again, I will be happy to provide any further details and statistics if you think you can help!&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;-AJ&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 10 Jan 2013 14:56:09 GMT</pubDate>
    <dc:creator>aweber1nj</dc:creator>
    <dc:date>2013-01-10T14:56:09Z</dc:date>
    <item>
      <title>solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265367#M218497</link>
      <description>I realize this is an overly-broad topic, but maybe if I open a thread we can get some good collaboration going on the situation…We have about 2.5mm nodes in our repo (virtually all of them are indexed by solr).&amp;nbsp; Many of these are actually folders, so they don't have any true-content per se.&amp;nbsp; The num</description>
      <pubDate>Wed, 09 Jan 2013 18:22:40 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265367#M218497</guid>
      <dc:creator>aweber1nj</dc:creator>
      <dc:date>2013-01-09T18:22:40Z</dc:date>
    </item>
    <item>
      <title>Re: solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265368#M218498</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;What version of Alfresco are you using?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Are all queries the same?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Have you checked you are really using SOLR? (Does the SOLR admin statistics page show any calls made to the query handlers?) &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;What search examples cause the problem?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Is it the same for site specific, all site and repository scoped queries?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jan 2013 14:35:56 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265368#M218498</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2013-01-10T14:35:56Z</dc:date>
    </item>
    <item>
      <title>Re: solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265369#M218499</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thanks for the reply, Andy…&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Alfresco 4.0.e on CentOS6 (64bit), 4-cores, 12GB RAM (and that host is only running Solr, so it can technically use anything it wants).&amp;nbsp; Alfresco host is same hw/sw specs.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Not all queries are the same.&amp;nbsp; I try to search for a single word that I know is in roughly 100-200 nodes (folders + documents).&amp;nbsp; However, re-running the same query (over time, not back-to-back) yields almost identical times.&amp;nbsp; We are getting our timing from:&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;log4j.logger.org.alfresco.repo.search.impl.solr.SolrQueryHTTPClient=debug&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;SPAN&gt; And we are testing these when no one else is on the system (and one query at a time).&amp;nbsp; Thus, we're almost positive these queries are going to Solr, and the Solr Admin Page does show cache statistics being updated.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;As I think I mentioned, searches against a single, custom property that we have indexed returns very fast.&amp;nbsp; It's when we try putting a value in the "Keywords" box in Advanced Search (even with additional properties), or when we run a "simple search" from the document library view of Share that it completely drags.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We only have one Site in use at this time, and we're using Share to run the tests (except when we try to copy the alfresco-fts text to the Node Browser).&amp;nbsp; I honestly don't know how to re-scope the queries.&amp;nbsp; I believe Share is always defaulting to search the Site, but that's only a SWAG.&amp;nbsp; We never explicitly specify either-way (but could, if you let me know how and you think it'll help troubleshoot).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks again for your reply.&amp;nbsp; I hope this is something we can figure out and help everyone with.&amp;nbsp; Solr should be able to run these kinds of queries against this "library" in sub-second speed, but I'm not even looking for that.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Again, I will be happy to provide any further details and statistics if you think you can help!&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;-AJ&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jan 2013 14:56:09 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265369#M218499</guid>
      <dc:creator>aweber1nj</dc:creator>
      <dc:date>2013-01-10T14:56:09Z</dc:date>
    </item>
    <item>
      <title>Re: solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265370#M218500</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;What is the the query you enter into the search box?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Have you customized the share search string?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Have you added any custom models?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;My initial guess is an issue that was fixed when generating cross language search strings as used for the old lucene impl against SOLR.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;This was building a large wildcard expression for a locale part for the token with a scan across all terms of content.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;You could upgrade to 4.2 to fix this (or use enterprise)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;There is no way to avoid this expansion in the config. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;You could confirm this is the cause of your problems with a couple of stack dumps if they show the query in wild card expansion/term enumeration.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;See&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ALF-15491&amp;nbsp; SOLR is generating queries for lucene style cross-language support &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jan 2013 19:32:55 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265370#M218500</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2013-01-10T19:32:55Z</dc:date>
    </item>
    <item>
      <title>Re: solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265371#M218501</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I PM'ed you earlier with some log entries that should indicate whether or not the query is of the "wrong flavour".&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Yes, we have a custom model, and we did add three (I think) of the indexed properties to the search-box-string.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Is it possible for you to take a peek at the PM I sent and see if you recognize whether we are running into ALF-15491 as you theorize?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks again,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;AJ&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jan 2013 20:04:23 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265371#M218501</guid>
      <dc:creator>aweber1nj</dc:creator>
      <dc:date>2013-01-10T20:04:23Z</dc:date>
    </item>
    <item>
      <title>Re: solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265372#M218502</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Andy,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We tried some additional "self help" and plunged into some of the code (of which it appears you are the author in many cases).&amp;nbsp; We tried installing the 4.2 Solr standalone stack on our Solr host, but it would not run against the 4.0.e Alfresco install.&amp;nbsp; We also tried copying the 4.2 webscripts to the 4.0 Alfresco installation (in hopes that the 4.2 Solr install would then be able to track against the 4.0 install), but this didn't work either.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We are "snookered" at this point, as we've exhausted all our ideas to workaround this bug.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Do you have any ideas that we could test to upgrade just the solr piece of the pie?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks again for the help,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;AJ&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 15 Jan 2013 18:56:13 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265372#M218502</guid>
      <dc:creator>aweber1nj</dc:creator>
      <dc:date>2013-01-15T18:56:13Z</dc:date>
    </item>
    <item>
      <title>Re: solr performance issues</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265373#M218503</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I believe you are hitting the issue I mentioned.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;The only option you have is to upgrade to 4.2 or go though support for the fix.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;SOLR and Alfresco releases are normally in sync - bug fixes can affect both sides - even if they do not affect the API they use to communicate.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Andy&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 22 Jan 2013 14:54:03 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/solr-performance-issues/m-p/265373#M218503</guid>
      <dc:creator>andy</dc:creator>
      <dc:date>2013-01-22T14:54:03Z</dc:date>
    </item>
  </channel>
</rss>

