07-18-2008 06:08 AM
07-18-2008 05:39 PM
07-18-2008 09:06 PM
07-24-2008 03:22 AM
Alfresco tends to be I/O bound, so it's best to have a high bandwidth, low latency disk subsystem for Alfresco to use (eg. RAID or a SAN) as well as a high bandwidth, low latency connection to the relational database that Alfresco is configured to use. Larger RAM can help here, to give the OS more room for caches, but most modern CPUs (dual core, 2+Ghz etc.) are more than enough for Alfresco (Alfresco doesn't perform very many computationally expensive operations - most operations are disk and database I/O intensive instead).
Generally Alfresco is much heavier on the CPU than the DB - adding a faster CPU/more cores will generally give you a lot better performance increase than changing the DB.
07-25-2008 11:59 PM
I have assumed that Alfresco performance comes down to hibernate + database performanceThat is my experience, but as I mentioned earlier performance is intricately tied to any number of factors.
1. is there anything special about the way Alfresco uses hibernate which affects general principles applicable to hibernate based apps?Alfresco does not do anything particularly special with Hibernate, other than programmatically forcing flushes during certain large operations (to avoid Hibernate running out of memory in the session cache - Hibernate places no upper bound on the session cache, so for large transactions it can consume all heap then fail with out of memory errors). I believe this primarily occurs in some of the WCM related operations (eg. promote-to-staging), but may also happen within some large DM operations as well (although I'm not 100% sure of that - it should be fairly evident from the code exactly where this flushing occurs).
2. which database performs best (MySQL, Postgres etc) with Alfresco and/or hibernate?I don't know if anyone has performed an empirical comparison of performance with Alfresco running on different database servers, but such an exercise would be extremely valuable. What I can say is that MySQL is probably the most widely deployed database for Alfresco, so it's likely to be the most highly tuned. FWIW Alfresco engineering went through a performance tuning exercise for the non-MySQL databases in the 2.0 timeframe, so it's not like they haven't been tuned at all.
3. if you start with an affordable system (eg quad core Xeon, 4 GB RAM, a couple of 15K SAS drives), where is the bottleneck likely to be?To echo my earlier disclaimer, it's impossible to predict with any real certainty what the performance characteristics of any given configuration / system architecture will be. In other words I would suggest empirically determining the bottleneck, rather than trying to guess what it is and risk architecting a solution based on a false assumption.
Tags
Find what you came for
We want to make your experience in Hyland Connect as valuable as possible, so we put together some helpful links.