<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic 201702 solr6 not indexing all documents in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38415#M16155</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I've just upgraded my previous alfresco 50d instance to 201702, and also switched to solr6.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Left the new instance indexing over the weekend, but on morning morning when I checked (after almost 36 hours), it had only indexed perhaps 12% of documents, and from previous experience: (1) the CPU usage is not high enough to indicate it is continuing the indexing (it is indexing new documents, but no the existing/old ones), and (2) 36 hours was more than enough time for solr4 to reindex 90+% of the documents when I upgraded from alfresco 4.2 to 502d.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As example, on the previous alfresco 50d instance, my solr4 indexes were taking up 235GB of disk space, alfresco core had 3880462 current documents, and archive core had 2036546 documents.&amp;nbsp; Currently, the new solr6 index is only taking up 3.5gB of disk space, whilst alfresco core has only 475244 current documents, and archive 432393.&amp;nbsp; Unless solr6 is super-efficient to almost 2 orders of magnitude, I think I have a serious index issue!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I don't see any errors in the solr6.log nor solr-8983-console.log, related to this.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Anybody encountered similar issues?&amp;nbsp; If by the end of the week the indexing hasn't substancially increased in document count, then I will probably have to switch back to solr4.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 10 Apr 2017 04:27:02 GMT</pubDate>
    <dc:creator>xarope</dc:creator>
    <dc:date>2017-04-10T04:27:02Z</dc:date>
    <item>
      <title>201702 solr6 not indexing all documents</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38415#M16155</link>
      <description>I've just upgraded my previous alfresco 50d instance to 201702, and also switched to solr6.Left the new instance indexing over the weekend, but on morning morning when I checked (after almost 36 hours), it had only indexed perhaps 12% of documents, and from previous experience: (1) the CPU usage is</description>
      <pubDate>Mon, 10 Apr 2017 04:27:02 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38415#M16155</guid>
      <dc:creator>xarope</dc:creator>
      <dc:date>2017-04-10T04:27:02Z</dc:date>
    </item>
    <item>
      <title>Re: 201702 solr6 not indexing all documents</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38416#M16156</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I also ran a solr6 report (i.e. &lt;A class="link-titled" href="http://sgsolr03:8983/solr/admin/cores?action=REPORT&amp;amp;amp;wt=xml" title="http://sgsolr03:8983/solr/admin/cores?action=REPORT&amp;amp;amp;wt=xml" rel="nofollow noopener noreferrer"&gt;http://&amp;lt;solrserver&amp;gt;:8983/solr/admin/cores?action=REPORT&amp;amp;amp;wt=xml&lt;/A&gt;&amp;nbsp;), here is the alfresco core section:&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&amp;lt;long name="DB acl transaction count"&amp;gt;8229&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of duplicated acl transactions in the index"&amp;gt;0&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of acl transactions in the index but not the DB"&amp;gt;0&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of missing acl transactions from the Index"&amp;gt;0&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index acl transaction count"&amp;gt;8231&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index unique acl transaction count"&amp;gt;8231&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Last indexed change set commit time"&amp;gt;1491820680654&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;str name="Last indexed change set commit date"&amp;gt;2017-04-10T18:38:00&amp;lt;/str&amp;gt;&lt;BR /&gt;&amp;lt;long name="Last changeset id before holes"&amp;gt;-1&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="DB transaction count"&amp;gt;1372926&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of duplicated transactions in the index"&amp;gt;0&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of transactions in the index but not the DB"&amp;gt;134&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="First transaction in the index but not the DB"&amp;gt;815448&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of missing transactions from the Index"&amp;gt;1054&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="First transaction missing from the Index"&amp;gt;4920501&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index transaction count"&amp;gt;179183&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index unique transaction count"&amp;gt;179183&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index node count"&amp;gt;243985&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of duplicate nodes in the index"&amp;gt;71&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="First duplicate node id in the index"&amp;gt;4772300&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index error count"&amp;gt;4&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of duplicate error docs in the index"&amp;gt;0&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Index unindexed count"&amp;gt;7236&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Count of duplicate unindexed docs in the index"&amp;gt;0&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;long name="Last indexed transaction commit time"&amp;gt;1491877238167&amp;lt;/long&amp;gt;&lt;BR /&gt;&amp;lt;str name="Last indexed transaction commit date"&amp;gt;2017-04-11T10:20:38&amp;lt;/str&amp;gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;Based on the documentation in &lt;A class="link-titled" href="http://docs.alfresco.com/5.2/concepts/solr-unindex.html" title="http://docs.alfresco.com/5.2/concepts/solr-unindex.html" rel="nofollow noopener noreferrer"&gt;Unindexed Solr Transactions | Alfresco Documentation&lt;/A&gt;, seems like there are index errors, e.g. transaction 815448 ("First transaction in the index but not the DB") and 4920501 ("First transaction missing from the Index").&amp;nbsp; More digging around solr6 documentation to see if I can understand why!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 11 Apr 2017 04:59:56 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38416#M16156</guid>
      <dc:creator>xarope</dc:creator>
      <dc:date>2017-04-11T04:59:56Z</dc:date>
    </item>
    <item>
      <title>Re: 201702 solr6 not indexing all documents</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38417#M16157</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Since I couldn't figure this out, I snapshot'd and created a new database and alfresco+solr instance, and installed solr4.&amp;nbsp; In less than 24 hours, solr4 has indexed 1259637 documents in the alfresco core, and 1158783 documents in the archive core.&lt;/P&gt;&lt;P&gt;If I use a psql statement (I'm using postgresql) to get the number of "content" objects, "SELECT count(*) FROM alf_node AS a, alf_qname AS q WHERE a.type_qname_id=q.id AND q.local_name='content';", I get 1443451.&amp;nbsp; So that's pretty close.&amp;nbsp; But I still can't understand why solr6 is stuck, now more than 4 days, with the same ~400K of documents (and not 1.4M).&lt;/P&gt;&lt;P&gt;Looks like I will be switching back to solr4 over this weekend.&amp;nbsp; I hope others have better luck with solr6.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 12 Apr 2017 06:48:31 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/201702-solr6-not-indexing-all-documents/m-p/38417#M16157</guid>
      <dc:creator>xarope</dc:creator>
      <dc:date>2017-04-12T06:48:31Z</dc:date>
    </item>
  </channel>
</rss>

