cancel
Showing results for 
Search instead for 
Did you mean: 

Other File Systems with Alfresco - HadoopFS?

ashpal19
Champ in-the-making
Champ in-the-making
Hello,

As per my understanding Alfresco currently uses the NTFS for storing of the files. Below are my questions -
1) if we can integrate Alfresco with the Hadoop File System for storage of the actual documents?

2) If not HadoopFS, can Alfresco work with(integrate) with any other system?

Thank you for your help. Please feel free to provide additional suggestions.

2 REPLIES 2

mrogers
Star Contributor
Star Contributor
No its not specific to NTFS but alfresco uses the OS filesystem.     You can use any system that mounts or looks like a filesystem.

Its also possible to integrate with other content stores,  for example I think Alfresco Enterprise comes with an Amazon S3 store.

I can't see a hadoop store in the alfresco add ons,  sorry.

nickburch
Confirmed Champ
Confirmed Champ
Writing your own Content Store implementation isn't too hard, so producing a HDFS backed one wouldn't take that much time.

Your big issue is that HDFS normally uses quite a large block size by default (IIRC 64mb), so if you're storing small files you'll either want to implement your own way of putting multiple files into one HDFS block, or accept lots of wasted space. Dropping the block size isn't generally recommended, as that puts too much pressure on the name node. If you do store multiple small files in one HDFS block, think carefully about how you do it to ensure that other tools you might run across the files (eg Map Reduce jobs) can easily work with them.