<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Setup for import 1 mio files in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307386#M260516</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I want to import several millions of files into Alfresco. The file names are numerical values like 1000001, 1000002, etc. The files are later accessed by their file name only. The directory where they will be stored in Alfresco is irrelevant for the later access from outside Alfresco.&amp;nbsp; Alfresco is accessed via CMIS.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Is there are performance difference between the following two configurations?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;(1) all files are stored in one directory e.g. /base_dir/1000001, /base_dir/1000002, …&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;(2) the files are stored with an additional intermediate subdir derived from the last digit: &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; /base_dir/00/1000000, /base_dir/00/1000010, /base_dir/00/1000020, …&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; /base_dir/01/1000001, /base_dir/01/1000011, /base_dir/01/1000021, …&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; /base_dir/02/1000002, /base_dir/02/1000012, /base_dir/02/1000022, …&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; …&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Does the creation of files in an Alfresco directory (like /base_dir) work sequentially or concurrently?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;If it is sequentially with respect to one directory would the creation in different directories (e.g. /base_dir/00 and /base_dir/01) work concurrently?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Would the configuration (2) be faster when filled by multiple threads than configuration (1)?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;U.Straub&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 12 Aug 2013 10:18:07 GMT</pubDate>
    <dc:creator>ustraub</dc:creator>
    <dc:date>2013-08-12T10:18:07Z</dc:date>
    <item>
      <title>Setup for import 1 mio files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307386#M260516</link>
      <description>Hi,I want to import several millions of files into Alfresco. The file names are numerical values like 1000001, 1000002, etc. The files are later accessed by their file name only. The directory where they will be stored in Alfresco is irrelevant for the later access from outside Alfresco.&amp;nbsp; Alfresco i</description>
      <pubDate>Mon, 12 Aug 2013 10:18:07 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307386#M260516</guid>
      <dc:creator>ustraub</dc:creator>
      <dc:date>2013-08-12T10:18:07Z</dc:date>
    </item>
    <item>
      <title>Re: Setup for import 1 mio files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307387#M260517</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Some answers.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Content ingestion is concurrent with retries.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Yes multiple threads will help.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;You may also like to look at the bulk upload tool.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 12 Aug 2013 10:43:56 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307387#M260517</guid>
      <dc:creator>mrogers</dc:creator>
      <dc:date>2013-08-12T10:43:56Z</dc:date>
    </item>
    <item>
      <title>Re: Setup for import 1 mio files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307388#M260518</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;what do you mean by "Content ingestion is concurrent with retries."?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;If I understand you correctly your recommendation is to use configuration (2) with 10 threads? Theoretically it should be at best 10 times faster than configuration (1) with 10 threads (in practice a factor 5 would be great), is that correct?. &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Configuration (1) with 1 thread would as fast as with 10 threads, correct?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;What performance do you expect with your bulk upload tool? &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Currently we have 0,3 sec per file uploaded with configuration (1) and 1 thread (and it gets worse with more threads).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;U.Straub&lt;/SPAN&gt;&lt;BR /&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 12 Aug 2013 11:22:05 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307388#M260518</guid>
      <dc:creator>ustraub</dc:creator>
      <dc:date>2013-08-12T11:22:05Z</dc:date>
    </item>
    <item>
      <title>Re: Setup for import 1 mio files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307389#M260519</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;In addition to the bulk upload tool, there is the Alfresco ETL Connector for Talend, which uses an ETL-optimized version of the native bulk import services (per file transaction &amp;amp; error code…) :&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://knowledge.openwide.fr/Main/AlfrescoETLConnector" rel="nofollow noopener noreferrer"&gt;http://knowledge.openwide.fr/Main/AlfrescoETLConnector&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 09 Oct 2013 11:16:50 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307389#M260519</guid>
      <dc:creator>mdutoo</dc:creator>
      <dc:date>2013-10-09T11:16:50Z</dc:date>
    </item>
    <item>
      <title>Re: Setup for import 1 mio files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307390#M260520</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;There is also the migration-center to import files from filesystem into alfresco. You can download a free version here:&lt;/SPAN&gt;&lt;BR /&gt;&lt;A href="http://www.migration-center.com/free-versions/" rel="nofollow noopener noreferrer"&gt;http://www.migration-center.com/free-versions/&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Jens&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 11 Oct 2013 07:28:19 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307390#M260520</guid>
      <dc:creator>jego</dc:creator>
      <dc:date>2013-10-11T07:28:19Z</dc:date>
    </item>
    <item>
      <title>Re: Setup for import 1 mio files</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307391#M260521</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;The second approach is mandatory, no question about it.&lt;/SPAN&gt;&lt;BR /&gt;&lt;A href="http://forums.alfresco.com/forum/developer-discussions/technical-architecture-discussion/folder-structure-alfresco-10112013" rel="nofollow noopener noreferrer"&gt;http://forums.alfresco.com/forum/developer-discussions/technical-architecture-discussion/folder-structure-alfresco-10112013&lt;/A&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 22 Oct 2013 17:35:00 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/setup-for-import-1-mio-files/m-p/307391#M260521</guid>
      <dc:creator>lista</dc:creator>
      <dc:date>2013-10-22T17:35:00Z</dc:date>
    </item>
  </channel>
</rss>

