<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Problem with Content Indexing (full text search) in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291344#M244474</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi Mits,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;thank you very much for your response.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;In the past in this repository we have imported thousands of documents using a bulk upload but this is not the case.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;The document I am talking about was imported with a process that involved only 10 documents, so this transaction was relatively small and all the others 9 documents has the content indexed correctly).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;With this query FTS: TYPE:"my:customBaseType" AND NOT TEXT:"*" I can find out all the documents without content in the index but this is not useful at all because it give out for example all pdf containing images, and I have thousands in the repository.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 20 Jan 2014 15:26:44 GMT</pubDate>
    <dc:creator>castgroupteam</dc:creator>
    <dc:date>2014-01-20T15:26:44Z</dc:date>
    <item>
      <title>Problem with Content Indexing (full text search)</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291342#M244472</link>
      <description>Hi all,I have an Alfresco CE 4.0.e (using solr) installed on a production environment.The problem is that I found out some pdf documents which has no content indexed with the consequence that Full text search, for those documents, is not working.Im sure it's not a problem of transformation from pdf</description>
      <pubDate>Thu, 16 Jan 2014 10:34:28 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291342#M244472</guid>
      <dc:creator>castgroupteam</dc:creator>
      <dc:date>2014-01-16T10:34:28Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Content Indexing (full text search)</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291343#M244473</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;This kind of issue generally occures when you have imported large amount of data in alfresco using bulk upload in that case solr take some time for sync up and during that time interval all those documents are non searchable. As you are using CE you will not be able to figure out which are transactions failed during indexing you need to go for re-indexing for solr.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Did you find any other error related to indexing in your solr logs or alfresco logs?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 20 Jan 2014 09:27:05 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291343#M244473</guid>
      <dc:creator>mitpatoliya</dc:creator>
      <dc:date>2014-01-20T09:27:05Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Content Indexing (full text search)</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291344#M244474</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi Mits,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;thank you very much for your response.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;In the past in this repository we have imported thousands of documents using a bulk upload but this is not the case.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;The document I am talking about was imported with a process that involved only 10 documents, so this transaction was relatively small and all the others 9 documents has the content indexed correctly).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;With this query FTS: TYPE:"my:customBaseType" AND NOT TEXT:"*" I can find out all the documents without content in the index but this is not useful at all because it give out for example all pdf containing images, and I have thousands in the repository.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 20 Jan 2014 15:26:44 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/problem-with-content-indexing-full-text-search/m-p/291344#M244474</guid>
      <dc:creator>castgroupteam</dc:creator>
      <dc:date>2014-01-20T15:26:44Z</dc:date>
    </item>
  </channel>
</rss>

