cancel
Showing results for 
Search instead for 
Did you mean: 

How to migrate millions of images in alfresco?

dynamolalit
Champ on-the-rise
Champ on-the-rise
Hi,

I have a  legacy system which has millions of images in form of jpeg / gif / bitmap etc each having size around 250kb approx, which i need to migrate to alfresco repository. For same i have following questions:
1. What is the best strategy to perform this kind of big migration?
2. Can i use alfresco bulk import tool  http://code.google.com/p/alfresco-bulk-filesystem-import/ for same? Does it support to this magnitude or should i use some other tool?
3. i am most bothered for performance with this big data as once content is migrated to alfresco,how should i classify these images not to mess with performance as i can broadly assume that for optimum performance, one alfresco repo folder should contain around 2000 content items.

Please help.

Reagrds,
Lalit.
2 REPLIES 2

mrogers
Star Contributor
Star Contributor
Bulk import is probably the way to go.   

You need to be clearer about what you requirements and concerns R.E. performance.   Don't think this rule of thumb for folder content applies to more recent versions of alfresco. 
What's far more important is how the content is organised and accessed and whether you expect your users to page through thousands of documents.

dynamolalit
Champ on-the-rise
Champ on-the-rise
Thanks Rogers,

But is bulk tool alfresco supported or recommended as i can see its a community driven tool hence in case of support can i get one?

My concerns about performance have two parts: can alfresco repository handles such big content numbers or how can i adjust alfresco to handle this kind of load (millions of images) w.r.t to hardware , RAM, file system I/O operations etc? Also can SOLR play the difference?

Second is if rule of thumb (about 2000 content items per folder) does not hold valid then what are new specifications for same? How to classify content items then? What is best way to organize and access content in this case? I can expect users at times to access through these records.

P.S. Can you please provide reference to DevCon2012 presentations as it happened?

Regards.