cancel
Showing results for 
Search instead for 
Did you mean: 

data migration alfresco community3.2

robobot
Champ in-the-making
Champ in-the-making
hi,
maybe this problem was already handled somewhere here. sorry, but i couldn't find anything.
does anybody have a hint, what will be the best way to migrate a large amount of data from alfresco version 2 to alfresco community 3.2 ?
the only idea i have for the moment, is to do this via the webservice as follows:

- retrieve all files in the repository
- download all files to the local file sytem
- re-upload the file to the new destination

there are no further requirements.

thanks for any suggestion!
steve
8 REPLIES 8

derek
Star Contributor
Star Contributor
I presume that a simple upgrade of the WAR file and fix-up of any customizations didn't work …

robobot
Champ in-the-making
Champ in-the-making
yes, replacing the war file did not work.  Smiley Sad

robobot
Champ in-the-making
Champ in-the-making
Problems when trying to get a large amount of data.
i have the following path to fetch all documents in the folder "Invoices".

PATH:"//app:company_home/app:user_homes/cm:myProjects/cm:Invoices//." AND TYPE:\{http\://www.alfresco.org/model/content/1.0\}content

Node [] nodes = WebServiceFactory.getRepositoryService().get(new Predicate(null,STORE,query));

if there are 100 or 200 documents in the Invoicefolder, everything ist working fine. If there are 5500 documents in the folder, the query does not work. i will get just one document. can't find an explanation for this ??

what goes wrong here, and what can i do ?

thanks in advance!

robobot
Champ in-the-making
Champ in-the-making
here is a piece of code i use. problem still isn't solved. if there are many documents in the folder, the webservice seems to have problem with the path query



public List<DMSData> getDocuments(final String queryString) {   
      final List<DMSData> results = new LinkedList<DMSData>();
      final Query query = new Query(Constants.QUERY_LANG_LUCENE, queryString ); 
     
      Node [] nodes = null;
      try {
        nodes = WebServiceFactory.getRepositoryService().get(new Predicate(null,STORE,query));
      } catch (RepositoryFault e) {        
          e.printStackTrace();
          return results;
      } catch (RemoteException e) {
         e.printStackTrace();
         return results;
      }     

      //no results found
      if (nodes == null){
        return results;
      } 

      for(final Node row : nodes) {       

        dmsData = new DMSData();     

        for (final NamedValue namedValue : row.getProperties()) {  
           final String name = namedValue.getName();
           final String value = namedValue.getValue();
           dmsData.setValue(value);
           ….
         }
          results.add(dmsData);   
      }
      return results;
}

robobot
Champ in-the-making
Champ in-the-making
..help yourself or nobody helps you.

luckily i have just discovered the export/import functions in the web administration console. hope this is the way to a nice closing time.

ra74
Champ in-the-making
Champ in-the-making
Hi I've the similar problem

1. Using  webservices is not a good idea as there's no guarantee that query returns all the documents. And there's no way to find such case
After all what about old versions if you're using this functionality. And sooner or later errors like socket timeout occured

2. Export/import - officially export/import is not supported between releases. Due to large amount of data I've tried to export the folder direcly to the filesystem but it doesn't work. After all I suppose export of old versions is not supported

3. Upgrade to 3.2 should work. It takes a few days on my computer mainly because new implementation of versioning and versions history has to be migrated
Well at the end it failed due to strange hibernate error but I've no longer time to work on it

robobot
Champ in-the-making
Champ in-the-making
hmpf,
exporting the data (select the folder to export->administration console->export)
did not work..after waiting over 1 h, the process failed with unknown errors..
but hope against hope.

Let's see if we get on with this:
http://wiki.alfresco.com/wiki/Backup_and_Restore

groutal
Champ in-the-making
Champ in-the-making
Maybe using a specialized/specific tool would work.

What you can use is an ETL tool. There are open source tools available for free. Talend Open Studio is an open source ETL tool for data integration and migration experts. It's easy to learn for a non-technical user. What distinguishes Talend, when it comes to business users, is the tMap component. It allows the user to get a graphical and functional view of integration processes.

For more information: http://www.talend.com/