<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221128#M174258</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I'm also seeing this error, on Community Head revision 18722, and if Apache PDFbox gives the same problem, guess this needs some upstream help.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 19 Feb 2010 12:21:05 GMT</pubDate>
    <dc:creator>deepestblue</dc:creator>
    <dc:date>2010-02-19T12:21:05Z</dc:date>
    <item>
      <title>Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221123#M174253</link>
      <description>Hello,I get a error message when I am uploading some PDF in Alfresco (with Mysql, 3.2r2)…ERROR [pdfbox.filter.FlateFilter] Stop reading corrupt stream….‍‍‍Looking in the src of pdfbox I have found :…}&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; catch (OutOfMemoryError exception) &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; {&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;</description>
      <pubDate>Thu, 24 Dec 2009 09:28:25 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221123#M174253</guid>
      <dc:creator>dranakan</dc:creator>
      <dc:date>2009-12-24T09:28:25Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221124#M174254</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I also get the same error while edit wiki.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 06 Jan 2010 08:13:56 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221124#M174254</guid>
      <dc:creator>neozone</dc:creator>
      <dc:date>2010-01-06T08:13:56Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221125#M174255</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I get this message as well.&amp;nbsp; I have found that the error occurs when PDFs with an incompatible encoding.&amp;nbsp; Don't ask me specifically which encoding, because I haven't been able to figure that out help.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;For example, if I scan a doc from our copier and send it to Alfresco, all is well.&amp;nbsp; The PDF is indexed and a thumbnail is created in the Share site.&amp;nbsp; If I open that PDF with Adobe Acrobat, make a change and re-save it, Alfresco throws an exception when I then move that file into the Share site.&amp;nbsp; No thumbnail is created.&amp;nbsp; In prior versions of Alfresco (&amp;lt; 3.2R2), Alfresco would eventually run out of memory if too many of these incompatible PDFs were encountered.&amp;nbsp; This doesn't happen now, but we still see those exceptions.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Ben&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 06 Jan 2010 21:34:10 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221125#M174255</guid>
      <dc:creator>benswitzer</dc:creator>
      <dc:date>2010-01-06T21:34:10Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221126#M174256</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Thanks you.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Yes, the problem should result from the PDF File.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Does anyone know a way to check if a PDF is wrong ? (and indicates what is wrong)&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 11 Jan 2010 10:53:14 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221126#M174256</guid>
      <dc:creator>dranakan</dc:creator>
      <dc:date>2010-01-11T10:53:14Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221127#M174257</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi friends,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;i just found that on my Alfresco setup this error eror occured on 13 of some 200 random PDF documents, so may i join the club?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Seriously, i consider this an major problem, for two reasons:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;- As far as i understand PDFBOX, the decoding of the faulty PDFs is terminated at some random point WITH NO ERROR INDICATED TO THE CALLING CONVERTER, as the exceptions in org.apache.pdfbox.filter.FlateFilter are caught and converted into that innocent log message. Imagine your CxO not finding that important business report from last year for that reason… guess who gets kicked ass….&amp;nbsp; &lt;img id="smileysurprised" class="emoticon emoticon-smileysurprised" src="https://connect.hyland.com/i/smilies/16x16_smiley-surprised.png" alt="Smiley Surprised" title="Smiley Surprised" /&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;- and when i saw that &lt;/SPAN&gt;&lt;EM&gt;OutOfMemoryException &lt;/EM&gt;&lt;SPAN&gt;caught in PDFBOX, i'd liked to bang my head against the wall! WHEN I HAVE AN OUTOFMEMORYEXCEPTION IN MY APPLICATION, I WANT TO KNOW THAT!! I really have to know that,&amp;nbsp; since the continued operation of my Alfresco is seriously in danger… arghhhh!&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Well, i tried my luck with the current 1.0 snapshot from pdfbox.apache.org, but this was no better, so i'll propose to replace the PDFBOX converter with some external commandline tool…. i'll gonna post the configuration once it is working!&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Cheers&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Gyro&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 17 Feb 2010 21:10:58 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221127#M174257</guid>
      <dc:creator>gyro_gearless</dc:creator>
      <dc:date>2010-02-17T21:10:58Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221128#M174258</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I'm also seeing this error, on Community Head revision 18722, and if Apache PDFbox gives the same problem, guess this needs some upstream help.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 19 Feb 2010 12:21:05 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221128#M174258</guid>
      <dc:creator>deepestblue</dc:creator>
      <dc:date>2010-02-19T12:21:05Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221129#M174259</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Alfresco use my CPU to 100% from several days. I suspect a problem with this :&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;jstack (show java process)&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;&lt;BR /&gt;"DefaultScheduler_Worker-3" prio=10 tid=0x08f8b400 nid=0x86b runnable [0x62b82000]&lt;BR /&gt;&amp;nbsp;&amp;nbsp; java.lang.Thread.State: RUNNABLE&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.nio.ch.ChannelInputStream.read(ChannelInputStream.java:92)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.nio.ch.ChannelInputStream.read(ChannelInputStream.java:86)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked &amp;lt;0x71801578&amp;gt; (a sun.nio.ch.ChannelInputStream)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.BufferedInputStream.read1(BufferedInputStream.java:256)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.BufferedInputStream.read(BufferedInputStream.java:317)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked &amp;lt;0x718055a0&amp;gt; (a java.io.BufferedInputStream)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.BufferedInputStream.read1(BufferedInputStream.java:256)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.BufferedInputStream.read(BufferedInputStream.java:317)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked &amp;lt;0x718055c0&amp;gt; (a java.io.BufferedInputStream)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.BufferedInputStream.read(BufferedInputStream.java:237)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;- locked &amp;lt;0x718055e0&amp;gt; (a java.io.BufferedInputStream)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.FilterInputStream.read(FilterInputStream.java:66)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.io.PushbackInputStream.read(PushbackInputStream.java:122)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.io.PushBackInputStream.read(PushBackInputStream.java:84)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:200)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:870)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:141)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:213)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:870)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:519)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:179)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.transform.PdfBoxContentTransformer.transformInternal(PdfBoxContentTransformer.java:74)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:167)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:143)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.indexProperty(ADMLuceneIndexerImpl.java:948)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocumentsImpl(ADMLuceneIndexerImpl.java:625)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocuments(ADMLuceneIndexerImpl.java:590)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.updateFullTextSearch(ADMLuceneIndexerImpl.java:1569)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.fts.FullTextSearchIndexerImpl.index(FullTextSearchIndexerImpl.java:190)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at java.lang.reflect.Method.invoke(Method.java:597)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:304)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:182)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:149)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:106)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:171)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at $Proxy70.index(Unknown Source)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.alfresco.repo.search.impl.lucene.fts.FTSIndexerJob.execute(FTSIndexerJob.java:52)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.quartz.core.JobRunShell.run(JobRunShell.java:202)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:529)&lt;BR /&gt;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;Do you have same problem ?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;(I have post my general problem here : &lt;/SPAN&gt;&lt;A href="http://forums.alfresco.com/en/viewtopic.php?f=8&amp;amp;t=21348#p82506" rel="nofollow noopener noreferrer"&gt;http://forums.alfresco.com/en/viewtopic.php?f=8&amp;amp;t=21348#p82506&lt;/A&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 02 Mar 2010 07:26:25 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221129#M174259</guid>
      <dc:creator>dranakan</dc:creator>
      <dc:date>2010-03-02T07:26:25Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221130#M174260</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Looks like an issue has been reported: &lt;/SPAN&gt;&lt;A href="https://issues.alfresco.com/jira/browse/ALF-1493" rel="nofollow noopener noreferrer"&gt;https://issues.alfresco.com/jira/browse/ALF-1493&lt;/A&gt;&lt;SPAN&gt;.&amp;nbsp; Looks like a possible fix may be to drop in the latest version of pdfbox (1.1.0).&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 14 Apr 2010 15:56:45 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221130#M174260</guid>
      <dc:creator>opoplawski</dc:creator>
      <dc:date>2010-04-14T15:56:45Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221131#M174261</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I don't think 1.1.0 helps. I am a new user of 3.3g and encounter exactly the same problem. My install came with pdfbox-1.1.0.jar out of the box… or is it jar? - sorry&amp;nbsp; &lt;img id="smileyhappy" class="emoticon emoticon-smileyhappy" src="https://connect.hyland.com/i/smilies/16x16_smiley-happy.png" alt="Smiley Happy" title="Smiley Happy" /&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Most disturbing. Any help much appreciated.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Update:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I have also increased the lucene.indexer.maxfieldlength value to 1000000 and still get the problem.&amp;nbsp; :x&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 25 Oct 2010 07:47:47 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221131#M174261</guid>
      <dc:creator>slowlearner</dc:creator>
      <dc:date>2010-10-25T07:47:47Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221132#M174262</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;We had good success by replacing the original PDFBox 0.8 with a current 1.2 version.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Previously, we had 79 PDFs that where not indexed, after the upgrade and reindexing only 10 remained unindexed! And eventually these 10 proved to be corrupt, for example there were JPEGs saved as PDF and the like &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Cheers&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Gyro&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 25 Oct 2010 10:20:04 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221132#M174262</guid>
      <dc:creator>gyro_gearless</dc:creator>
      <dc:date>2010-10-25T10:20:04Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221133#M174263</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hallo, thanks for the advice and apologies for taking so long to reply. I do now have 1.2 installed and yet the problem persists. The files that don't get indexed properly aren't corrupted in sense of not being pdf files, they seem normal enough. But i don't close the door on the input being somehow implicated… will look out for patterns. It does appear (subject to confirmation) that my problem pdf files (i.e. not properly indexed) are all from one source so far. Will keep this thread posted as i find out more.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 30 Oct 2010 07:18:48 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221133#M174263</guid>
      <dc:creator>slowlearner</dc:creator>
      <dc:date>2010-10-30T07:18:48Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221134#M174264</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Unfortunately i am still no closer to getting this resolved. So far…&lt;/SPAN&gt;&lt;BR /&gt;&lt;UL&gt;Updated pdfbox to 1.2.1 - Check&lt;BR /&gt;Increased lucene.indexer.maxFieldLength - Check&lt;BR /&gt;Recovered index from scratch on startup - Check&lt;/UL&gt;&lt;SPAN&gt;… and yet i can index only a few of the pdf documents uploaded. The problem docs come from various sources and are in all other respects, perfectly valid pdf documents.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 06 Nov 2010 06:42:36 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221134#M174264</guid>
      <dc:creator>slowlearner</dc:creator>
      <dc:date>2010-11-06T06:42:36Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221135#M174265</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Has anybody come up with a solution?&amp;nbsp; I read another post where the person had success actually upgrading pdfbox.jar.&amp;nbsp; In this post, however, somebody has not had success with it.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If there is any Alfresco Engineer out there who is reading this, please help and advise.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 04 Apr 2011 20:53:53 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221135#M174265</guid>
      <dc:creator>lucille_arkenst</dc:creator>
      <dc:date>2011-04-04T20:53:53Z</dc:date>
    </item>
    <item>
      <title>Re: Alf 32r2 - Pdfbox - Stop reading corrupt stream</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221136#M174266</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I am getting this in 4.0.d&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 27 Nov 2012 10:50:57 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/alf-32r2-pdfbox-stop-reading-corrupt-stream/m-p/221136#M174266</guid>
      <dc:creator>sharifu</dc:creator>
      <dc:date>2012-11-27T10:50:57Z</dc:date>
    </item>
  </channel>
</rss>

