<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic upgrade from 5.0.d to 5.1.e: pdfbox error in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288868#M241998</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;After an Alfresco update from 5.0.d to 5.1.e, everything seems to work fine except&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;that a pdfbox error prevent from indexing my pdf files:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;&lt;BR /&gt; ERROR [pdfbox.filter.FlateFilter] [http-bio-8443-exec-2] FlateFilter: stop reading corrupt stream due to a DataFormatException&lt;BR /&gt;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Did somebody faced something similar ?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;(I've seen that pdfbox fires this kind of message in case of 'out of memory', but server memory is not overloaded in my case)&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks for your advise,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Vincent&lt;/SPAN&gt;&lt;BR /&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 07 Mar 2016 10:37:36 GMT</pubDate>
    <dc:creator>vincent-kali</dc:creator>
    <dc:date>2016-03-07T10:37:36Z</dc:date>
    <item>
      <title>upgrade from 5.0.d to 5.1.e: pdfbox error</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288868#M241998</link>
      <description>Hi,After an Alfresco update from 5.0.d to 5.1.e, everything seems to work fine exceptthat a pdfbox error prevent from indexing my pdf files: ERROR [pdfbox.filter.FlateFilter] [http-bio-8443-exec-2] FlateFilter: stop reading corrupt stream due to a DataFormatException‍‍‍Did somebody faced something s</description>
      <pubDate>Mon, 07 Mar 2016 10:37:36 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288868#M241998</guid>
      <dc:creator>vincent-kali</dc:creator>
      <dc:date>2016-03-07T10:37:36Z</dc:date>
    </item>
    <item>
      <title>Re: upgrade from 5.0.d to 5.1.e: pdfbox error</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288869#M241999</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I have installed the latest version today and tried to migrate from 5.0d. I get a lot of this kind of messages and can't preview lots of documents too…&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 18 May 2016 13:59:05 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288869#M241999</guid>
      <dc:creator>talleyrand</dc:creator>
      <dc:date>2016-05-18T13:59:05Z</dc:date>
    </item>
    <item>
      <title>Re: upgrade from 5.0.d to 5.1.e: pdfbox error</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288870#M242000</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I'm hanting for an indexing issue that is causing OutOfMemory errors and I'm starting to suspect that the culprit is PDFBox . My instalaltion is alfresco 5.1g. It uses PDFBox-1.8.10 and I found an issue in Tika that suggests that this version might not be a very good one:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;A class="link-titled" href="https://issues.apache.org/jira/browse/TIKA-1737" title="https://issues.apache.org/jira/browse/TIKA-1737" rel="nofollow noopener noreferrer"&gt;[TIKA-1737] PDFBox 1.8.10 is still a basket case - ASF JIRA&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I made a memory dump and I'm trying to analyze it with Eclipse MAT, the "Leak Suspects" report suggests that 75% of the heap is&amp;nbsp;full of PdfBox's COSObjects that are being retained by a classloader.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Not sure how to interpret this but PDFBox seems to be in the middle.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 08 Jun 2017 17:04:07 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288870#M242000</guid>
      <dc:creator>iblanco</dc:creator>
      <dc:date>2017-06-08T17:04:07Z</dc:date>
    </item>
    <item>
      <title>Re: upgrade from 5.0.d to 5.1.e: pdfbox error</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288871#M242001</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Yes, I also have the same issue.&lt;/P&gt;&lt;P&gt;I had this problem in 5.0 and I continue having it in 5.1.e.&lt;/P&gt;&lt;P&gt;However I don't think&amp;nbsp;there is a pdfbox&amp;nbsp;version (alfresco-patched)&amp;nbsp;more recent than 1.8.10. Meaning we have to continue using this version, right?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'll try to dig a bit more&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 14 Jun 2017 08:05:57 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/upgrade-from-5-0-d-to-5-1-e-pdfbox-error/m-p/288871#M242001</guid>
      <dc:creator>mauro1855</dc:creator>
      <dc:date>2017-06-14T08:05:57Z</dc:date>
    </item>
  </channel>
</rss>

