<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Large XLS files in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/large-xls-files/m-p/9578#M3449</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;(from email)&lt;BR /&gt;While I was evaluating your Alfresco ECM release candidate version 1, I encountered an error while trying to upload a large (200mb+ excel file).&amp;nbsp; You may already be aware of this, but I just wanted to let you know.&amp;nbsp; I wonder if this error is caused by the database I am using rather than your software (I am using MySQL at this point, but when our company implements an ECM solution, we will be using oracle database).&amp;nbsp; &lt;BR /&gt;&lt;BR /&gt;javax.faces.FacesException: Error calling action method of component with id add-content-upload-end:_id24&lt;BR /&gt;caused by:&lt;BR /&gt;javax.faces.el.EvaluationException: Exception while invoking expression #{AddContentWizard.next}&lt;BR /&gt;caused by:&lt;BR /&gt;&lt;STRONG&gt;java.lang.OutOfMemoryError: Java heap space&lt;/STRONG&gt;&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;The text extraction used by the indexing is provided by the POI libraries.&amp;nbsp; There is, unfortunately, no way to stream-convert the XLS document: it has to be loaded into memory and manipulated in memory.&amp;nbsp; I can't really say how much memory you will need based on document size.&amp;nbsp; My initial guess is that N*2 MB would be a minimum.&amp;nbsp; This would be over and above memory required for the usuals processing.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;If you have an alternative library available for extracting text from an XLS document, or if you wish to bypass indexing of the XLS documents, then feel free to recommend it to us.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The Open Office converter could also be used for XLS files larger than a given size, but this is slower than the POI libraries.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 31 Oct 2005 11:03:48 GMT</pubDate>
    <dc:creator>derek</dc:creator>
    <dc:date>2005-10-31T11:03:48Z</dc:date>
    <item>
      <title>Large XLS files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/large-xls-files/m-p/9578#M3449</link>
      <description>(from email)While I was evaluating your Alfresco ECM release candidate version 1, I encountered an error while trying to upload a large (200mb+ excel file).&amp;nbsp; You may already be aware of this, but I just wanted to let you know.&amp;nbsp; I wonder if this error is caused by the database I am using rather than</description>
      <pubDate>Mon, 31 Oct 2005 11:03:48 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/large-xls-files/m-p/9578#M3449</guid>
      <dc:creator>derek</dc:creator>
      <dc:date>2005-10-31T11:03:48Z</dc:date>
    </item>
    <item>
      <title>Re: Large XLS files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/large-xls-files/m-p/9579#M3450</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;This has been raised for implementation: &lt;/SPAN&gt;&lt;A href="http://www.alfresco.org/jira/browse/AR-205" rel="nofollow noopener noreferrer"&gt;http://www.alfresco.org/jira/browse/AR-205&lt;/A&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 31 Oct 2005 12:10:51 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/large-xls-files/m-p/9579#M3450</guid>
      <dc:creator>derek</dc:creator>
      <dc:date>2005-10-31T12:10:51Z</dc:date>
    </item>
  </channel>
</rss>

