10-21-2016 06:46 AM
My content store size is continuously increasing , I uploaded 60MB of around 240 pdf files , but my present content store folder size is 4.5GB
10-21-2016 08:26 AM
Which Alfresco version and edition are you using?
Could you please share your infrastructure architecture and configuration?
(File system, DBMS, alfresco-global.properties)
10-24-2016 02:46 AM
Hai openpj,
Thanks for the reply.
I am using Alfresco-community-5.1.f,
I installed Alfresco in Redhat 6.3 OS,
1) I configured OCR with that alfresco
2) i did Some customization in that alfresco
Below is the alfresco-global.properties
###############################
## Common Alfresco Properties #
###############################
dir.root=/contentmgt/alfresco/alf_data
alfresco.context=alfresco
alfresco.host=127.0.0.1
alfresco.port=9090
alfresco.protocol=http
share.context=share
share.host=127.0.0.1
share.port=9090
share.protocol=http
### database connection properties ###
db.driver=org.postgresql.Driver
db.username=alfresco
db.password=admin
db.name=alfresco
db.url=jdbcostgresql://localhost:5432/${db.name}
# Note: your database must also be able to accept at least this many connections. Please see your database documentation for instructions on how to configure this.
db.pool.max=275
db.pool.validate.query=SELECT 1
# The server mode. Set value here
# UNKNOWN | TEST | BACKUP | PRODUCTION
system.serverMode=UNKNOWN
### FTP Server Configuration ###
### RMI registry port for JMX ###
alfresco.rmi.services.port=50500
### External executable locations ###
ooo.exe=/contentmgt/alfresco/libreoffice/program/soffice.bin
ooo.enabled=true
ooo.port=9100
img.root=/contentmgt/alfresco/common
img.dyn=${img.root}/lib
img.exe=${img.root}/bin/convert
jodconverter.enabled=false
jodconverter.officeHome=/contentmgt/alfresco/libreoffice
jodconverter.portNumbers=9100
### Initial admin password ###
alfresco_user_store.adminpassword=209c6174da490caeb422f3fa5a7ae634
### E-mail site invitation setting ###
notification.email.siteinvite=false
### License location ###
dir.license.external=/contentmgt/alfresco
### Solr indexing ###
index.subsystem.name=solr4
dir.keystore=${dir.root}/keystore
solr.host=localhost
solr.port.ssl=9443
### Allow extended ResultSet processing
security.anyDenyDenies=false
### Smart Folders Config Properties ###
smart.folders.enabled=false
### Remote JMX (Default: disabled) ###
alfresco.jmx.connector.enabled=false
### Audit Configuration
audit.enabled=true
audit.alfresco-access.enabled=true
### FTP Server Configuration ###
ftp.enabled=true
ftp.port=1121
###properties of OCR ####
# Define a default priority for this transformer
content.transformer.ocr.tiff.available=true
content.transformer.ocr.tiff.priority=10
# List the transformations that are supported
content.transformer.ocr.tiff.extensions.tiff.txt.supported=true
content.transformer.ocr.tiff.extensions.tiff.txt.priority=10
content.transformer.ocr.tiff.extensions.jpg.txt.supported=true
content.transformer.ocr.tiff.extensions.jpg.txt.priority=10
content.transformer.ocr.tiff.extensions.png.txt.supported=true
content.transformer.ocr.tiff.extensions.png.txt.priority=10
content.transformer.ocr.tiff.extensions.gif.txt.supported=true
content.transformer.ocr.tiff.extensions.gif.txt.priority=10
# Define a default priority for this transformer
content.transformer.pdf.tiff.available=true
content.transformer.pdf.tiff.priority=100
# List the transformations that are supported
content.transformer.pdf.tiff.extensions.pdf.txt.supported=true
content.transformer.pdf.tiff.extensions.pdf.txt.priority=100
# Disable the OOTB transformers
content.transformer.double.ImageMagick.extensions.pdf.tiff.supported=false
content.transformer.double.ImageMagick.extensions.pdf.png.supported=false
content.transformer.complex.PDF.Image.extensions.pdf.tiff.supported=false
content.transformer.complex.PDF.Image.extensions.pdf.png.supported=false
content.transformer.ImageMagick.extensions.pdf.tiff.supported=false
content.transformer.ImageMagick.extensions.pdf.png.supported=false
content.transformer.PdfBox.extensions.pdf.txt.supported=false
content.transformer.TikaAuto.extensions.pdf.txt.supported=false
###Ldap####
authentication.chain=ldap-ad1:ldap-ad,alfrescoNtlm1:alfrescoNtlm
ntlm.authentication.sso.enabled=false
ldap.authentication.active=true
ldap.authentication.userNameFormat=%s@netaddress.co.in
ldap.authentication.java.naming.factory.initial=com.sun.jndi.ldap.LdapCtxFactory
ldap.authentication.java.naming.provider.url=ldap://IPADDRESS
ldap.authentication.java.naming.security.authentication=simple
ldap.authentication.escapeCommasInBind=false
ldap.authentication.escapeCommasInUid=false
ldap.authentication.defaultAdministratorUserNames=contentmgt
ldap.synchronization.active=true
ldap.synchronization.java.naming.security.principal=contentmgt@netaddress.co.in
ldap.synchronization.java.naming.security.credentials=contentmgt123
ldap.synchronization.queryBatchSize=10000
ldap.synchronization.attributeBatchSize=10000
ldap.synchronization.groupQuery=(objectclass\=group)
ldap.synchronization.personQuery=objectClass=user
ldap.synchronization.groupDifferentialQuery=(&(objectclass\=group)(!(modifyTimestamp<\={0})))
ldap.synchronization.personDifferentialQuery=(&(objectclass\=user)(userAccountControl\:1.2.840.113556.1.4.803\:\=512)(!(modifyTimestamp<\={0})))
ldap.synchronization.groupSearchBase=ou=RDGROUP\,dc=netaddress,dc=co,dc=in
ldap.synchronization.userSearchBase=ou=RDGROUP,dc=netaddress,dc=co,dc=in
cifs.domain=netaddress.co.in
ldap.synchronization.modifyTimestampAttributeName=whenChanged
ldap.synchronization.timestampFormat=yyyyMMddHHmmss'.0Z'
ldap.synchronization.userIdAttributeName=sAMAccountName
###ldap.synchronization.userFirstNameAttributeName=givenName
ldap.synchronization.userFirstNameAttributeName=displayName
ldap.synchronization.userLastNameAttributeName=sn
ldap.synchronization.userEmailAttributeName=mail
ldap.synchronization.groupIdAttributeName=cn
ldap.synchronization.defaultHomeFolderProvider=userHomesHomeFolderProvider
ldap.synchronization.groupType=group
ldap.synchronization.personType=user
ldap.synchronization.groupMemberAttributeName=member
synchronization.autoCreatePeopleOnLogin=true
synchronization.synchronizeChangesOnly=true
# to sync on each alfresco startup
synchronization.syncOnStartup=true
synchronization.syncWhenMissingPeopleLogIn=true
### DONT USE UNIX CRON EXPRESSION- USE QUARTZ CRON EXPRESSIONS!!!
### look here http://www.quartz-scheduler.org/docs/tutorials/crontrigger.html
### synchronisation starts every 15 minutes!
synchronization.import.cron=0 0/15 * * * ?
ldap.authentication.allowGuestLogin=true
###############################
## Common Alfresco Properties #
###############################
dir.root=/contentmgt/alfresco/alf_data
alfresco.context=alfresco
alfresco.host=127.0.0.1
alfresco.port=9090
alfresco.protocol=http
share.context=share
share.host=127.0.0.1
share.port=9090
share.protocol=http
### database connection properties ###
db.driver=org.postgresql.Driver
db.username=alfresco
db.password=admin
db.name=alfresco
db.url=jdbcostgresql://localhost:5432/${db.name}
# Note: your database must also be able to accept at least this many connections. Please see your database documentation for instructions on how to configure this.
db.pool.max=275
db.pool.validate.query=SELECT 1
# The server mode. Set value here
# UNKNOWN | TEST | BACKUP | PRODUCTION
system.serverMode=UNKNOWN
### FTP Server Configuration ###
### RMI registry port for JMX ###
alfresco.rmi.services.port=50500
### External executable locations ###
ooo.exe=/contentmgt/alfresco/libreoffice/program/soffice.bin
ooo.enabled=true
ooo.port=9100
img.root=/contentmgt/alfresco/common
img.dyn=${img.root}/lib
img.exe=${img.root}/bin/convert
jodconverter.enabled=false
jodconverter.officeHome=/contentmgt/alfresco/libreoffice
jodconverter.portNumbers=9100
### Initial admin password ###
alfresco_user_store.adminpassword=209c6174da490caeb422f3fa5a7ae634
### E-mail site invitation setting ###
notification.email.siteinvite=false
### License location ###
dir.license.external=/contentmgt/alfresco
### Solr indexing ###
index.subsystem.name=solr4
dir.keystore=${dir.root}/keystore
solr.host=localhost
solr.port.ssl=9443
### Allow extended ResultSet processing
security.anyDenyDenies=false
### Smart Folders Config Properties ###
smart.folders.enabled=false
### Remote JMX (Default: disabled) ###
alfresco.jmx.connector.enabled=false
### Audit Configuration
audit.enabled=true
audit.alfresco-access.enabled=true
### FTP Server Configuration ###
ftp.enabled=true
ftp.port=1121
###properties of OCR ####
# Define a default priority for this transformer
content.transformer.ocr.tiff.available=true
content.transformer.ocr.tiff.priority=10
# List the transformations that are supported
content.transformer.ocr.tiff.extensions.tiff.txt.supported=true
content.transformer.ocr.tiff.extensions.tiff.txt.priority=10
content.transformer.ocr.tiff.extensions.jpg.txt.supported=true
content.transformer.ocr.tiff.extensions.jpg.txt.priority=10
content.transformer.ocr.tiff.extensions.png.txt.supported=true
content.transformer.ocr.tiff.extensions.png.txt.priority=10
content.transformer.ocr.tiff.extensions.gif.txt.supported=true
content.transformer.ocr.tiff.extensions.gif.txt.priority=10
# Define a default priority for this transformer
content.transformer.pdf.tiff.available=true
content.transformer.pdf.tiff.priority=100
# List the transformations that are supported
content.transformer.pdf.tiff.extensions.pdf.txt.supported=true
content.transformer.pdf.tiff.extensions.pdf.txt.priority=100
# Disable the OOTB transformers
content.transformer.double.ImageMagick.extensions.pdf.tiff.supported=false
content.transformer.double.ImageMagick.extensions.pdf.png.supported=false
content.transformer.complex.PDF.Image.extensions.pdf.tiff.supported=false
content.transformer.complex.PDF.Image.extensions.pdf.png.supported=false
content.transformer.ImageMagick.extensions.pdf.tiff.supported=false
content.transformer.ImageMagick.extensions.pdf.png.supported=false
content.transformer.PdfBox.extensions.pdf.txt.supported=false
content.transformer.TikaAuto.extensions.pdf.txt.supported=false
###Ldap####
authentication.chain=ldap-ad1:ldap-ad,alfrescoNtlm1:alfrescoNtlm
ntlm.authentication.sso.enabled=false
ldap.authentication.active=true
ldap.authentication.userNameFormat=%s@netaddress.co.in
ldap.authentication.java.naming.factory.initial=com.sun.jndi.ldap.LdapCtxFactory
ldap.authentication.java.naming.provider.url=ldap://IPADDRESS
ldap.authentication.java.naming.security.authentication=simple
ldap.authentication.escapeCommasInBind=false
ldap.authentication.escapeCommasInUid=false
ldap.authentication.defaultAdministratorUserNames=contentmgt
ldap.synchronization.active=true
ldap.synchronization.java.naming.security.principal=contentmgt@netaddress.co.in
ldap.synchronization.java.naming.security.credentials=contentmgt123
ldap.synchronization.queryBatchSize=10000
ldap.synchronization.attributeBatchSize=10000
ldap.synchronization.groupQuery=(objectclass\=group)
ldap.synchronization.personQuery=objectClass=user
ldap.synchronization.groupDifferentialQuery=(&(objectclass\=group)(!(modifyTimestamp<\={0})))
ldap.synchronization.personDifferentialQuery=(&(objectclass\=user)(userAccountControl\:1.2.840.113556.1.4.803\:\=512)(!(modifyTimestamp<\={0})))
ldap.synchronization.groupSearchBase=ou=RDGROUP\,dc=netaddress,dc=co,dc=in
ldap.synchronization.userSearchBase=ou=RDGROUP,dc=netaddress,dc=co,dc=in
cifs.domain=netaddress.co.in
ldap.synchronization.modifyTimestampAttributeName=whenChanged
ldap.synchronization.timestampFormat=yyyyMMddHHmmss'.0Z'
ldap.synchronization.userIdAttributeName=sAMAccountName
###ldap.synchronization.userFirstNameAttributeName=givenName
ldap.synchronization.userFirstNameAttributeName=displayName
ldap.synchronization.userLastNameAttributeName=sn
ldap.synchronization.userEmailAttributeName=mail
ldap.synchronization.groupIdAttributeName=cn
ldap.synchronization.defaultHomeFolderProvider=userHomesHomeFolderProvider
ldap.synchronization.groupType=group
ldap.synchronization.personType=user
ldap.synchronization.groupMemberAttributeName=member
synchronization.autoCreatePeopleOnLogin=true
synchronization.synchronizeChangesOnly=true
# to sync on each alfresco startup
synchronization.syncOnStartup=true
synchronization.syncWhenMissingPeopleLogIn=true
### DONT USE UNIX CRON EXPRESSION- USE QUARTZ CRON EXPRESSIONS!!!
### look here http://www.quartz-scheduler.org/docs/tutorials/crontrigger.html
### synchronisation starts every 15 minutes!
synchronization.import.cron=0 0/15 * * * ?
ldap.authentication.allowGuestLogin=true
10-26-2016 02:06 PM
How have you enable the OCR?
Have you checked if it's not the cause of your problem?
Maybe the problem is that the your OCR generator is executing again for each PDF it creates. That would explain this repository grow.
Tags
Find what you came for
We want to make your experience in Hyland Connect as valuable as possible, so we put together some helpful links.