cancel
Showing results for 
Search instead for 
Did you mean: 

Hai, For Every minute my content store folder size is increasing , I uploaded 60MB of totall 240 pdf files , but the content store folder in alf_data is 4.5GB , continuously increasing the size of content store. can u plz suggest me is there any process

sandeepreddy1
Star Contributor
Star Contributor

My content store size is continuously increasing , I uploaded 60MB of around 240 pdf files , but my present content store folder size is 4.5GB

3 REPLIES 3

openpj
Elite Collaborator
Elite Collaborator

Which Alfresco version and edition are you using?

Could you please share your infrastructure architecture and configuration?

(File system, DBMS, alfresco-global.properties)

Hai openpj,

Thanks for the reply.

I am using Alfresco-community-5.1.f,

I installed Alfresco in Redhat 6.3 OS,

1) I configured OCR with that alfresco

2) i did Some customization  in that alfresco

Below is the alfresco-global.properties

###############################

## Common Alfresco Properties #

###############################

dir.root=/contentmgt/alfresco/alf_data

alfresco.context=alfresco

alfresco.host=127.0.0.1

alfresco.port=9090

alfresco.protocol=http

share.context=share

share.host=127.0.0.1

share.port=9090

share.protocol=http

### database connection properties ###

db.driver=org.postgresql.Driver

db.username=alfresco

db.password=admin

db.name=alfresco

db.url=jdbcSmiley Tongueostgresql://localhost:5432/${db.name}

# Note: your database must also be able to accept at least this many connections.  Please see your database documentation for instructions on how to configure this.

db.pool.max=275

db.pool.validate.query=SELECT 1

# The server mode. Set value here

# UNKNOWN | TEST | BACKUP | PRODUCTION

system.serverMode=UNKNOWN

### FTP Server Configuration ###

### RMI registry port for JMX ###

alfresco.rmi.services.port=50500

### External executable locations ###

ooo.exe=/contentmgt/alfresco/libreoffice/program/soffice.bin

ooo.enabled=true

ooo.port=9100

img.root=/contentmgt/alfresco/common

img.dyn=${img.root}/lib

img.exe=${img.root}/bin/convert

jodconverter.enabled=false

jodconverter.officeHome=/contentmgt/alfresco/libreoffice

jodconverter.portNumbers=9100

### Initial admin password ###

alfresco_user_store.adminpassword=209c6174da490caeb422f3fa5a7ae634

### E-mail site invitation setting ###

notification.email.siteinvite=false

### License location ###

dir.license.external=/contentmgt/alfresco

### Solr indexing ###

index.subsystem.name=solr4

dir.keystore=${dir.root}/keystore

solr.host=localhost

solr.port.ssl=9443

### Allow extended ResultSet processing

security.anyDenyDenies=false

### Smart Folders Config Properties ###

smart.folders.enabled=false

### Remote JMX (Default: disabled) ###

alfresco.jmx.connector.enabled=false

### Audit Configuration

  audit.enabled=true

  audit.alfresco-access.enabled=true

  ### FTP Server Configuration ###

  ftp.enabled=true

  ftp.port=1121

###properties of OCR ####

# Define a default priority for this transformer

content.transformer.ocr.tiff.available=true

content.transformer.ocr.tiff.priority=10

# List the transformations that are supported

content.transformer.ocr.tiff.extensions.tiff.txt.supported=true

content.transformer.ocr.tiff.extensions.tiff.txt.priority=10

content.transformer.ocr.tiff.extensions.jpg.txt.supported=true

content.transformer.ocr.tiff.extensions.jpg.txt.priority=10

content.transformer.ocr.tiff.extensions.png.txt.supported=true

content.transformer.ocr.tiff.extensions.png.txt.priority=10

content.transformer.ocr.tiff.extensions.gif.txt.supported=true

content.transformer.ocr.tiff.extensions.gif.txt.priority=10

# Define a default priority for this transformer

content.transformer.pdf.tiff.available=true

content.transformer.pdf.tiff.priority=100

# List the transformations that are supported

content.transformer.pdf.tiff.extensions.pdf.txt.supported=true

content.transformer.pdf.tiff.extensions.pdf.txt.priority=100

# Disable the OOTB transformers

content.transformer.double.ImageMagick.extensions.pdf.tiff.supported=false

content.transformer.double.ImageMagick.extensions.pdf.png.supported=false

content.transformer.complex.PDF.Image.extensions.pdf.tiff.supported=false

content.transformer.complex.PDF.Image.extensions.pdf.png.supported=false

content.transformer.ImageMagick.extensions.pdf.tiff.supported=false

content.transformer.ImageMagick.extensions.pdf.png.supported=false

content.transformer.PdfBox.extensions.pdf.txt.supported=false

content.transformer.TikaAuto.extensions.pdf.txt.supported=false

###Ldap####

authentication.chain=ldap-ad1:ldap-ad,alfrescoNtlm1:alfrescoNtlm

ntlm.authentication.sso.enabled=false

ldap.authentication.active=true

ldap.authentication.userNameFormat=%s@netaddress.co.in

ldap.authentication.java.naming.factory.initial=com.sun.jndi.ldap.LdapCtxFactory

ldap.authentication.java.naming.provider.url=ldap://IPADDRESS

ldap.authentication.java.naming.security.authentication=simple

ldap.authentication.escapeCommasInBind=false

ldap.authentication.escapeCommasInUid=false

ldap.authentication.defaultAdministratorUserNames=contentmgt

ldap.synchronization.active=true

ldap.synchronization.java.naming.security.principal=contentmgt@netaddress.co.in

ldap.synchronization.java.naming.security.credentials=contentmgt123

ldap.synchronization.queryBatchSize=10000

ldap.synchronization.attributeBatchSize=10000

ldap.synchronization.groupQuery=(objectclass\=group)

ldap.synchronization.personQuery=objectClass=user

ldap.synchronization.groupDifferentialQuery=(&(objectclass\=group)(!(modifyTimestamp<\={0})))

ldap.synchronization.personDifferentialQuery=(&(objectclass\=user)(userAccountControl\:1.2.840.113556.1.4.803\:\=512)(!(modifyTimestamp<\={0})))

ldap.synchronization.groupSearchBase=ou=RDGROUP\,dc=netaddress,dc=co,dc=in

ldap.synchronization.userSearchBase=ou=RDGROUP,dc=netaddress,dc=co,dc=in

cifs.domain=netaddress.co.in

ldap.synchronization.modifyTimestampAttributeName=whenChanged

ldap.synchronization.timestampFormat=yyyyMMddHHmmss'.0Z'

ldap.synchronization.userIdAttributeName=sAMAccountName

###ldap.synchronization.userFirstNameAttributeName=givenName

ldap.synchronization.userFirstNameAttributeName=displayName

ldap.synchronization.userLastNameAttributeName=sn

ldap.synchronization.userEmailAttributeName=mail

ldap.synchronization.groupIdAttributeName=cn

ldap.synchronization.defaultHomeFolderProvider=userHomesHomeFolderProvider

ldap.synchronization.groupType=group

ldap.synchronization.personType=user

ldap.synchronization.groupMemberAttributeName=member

synchronization.autoCreatePeopleOnLogin=true

synchronization.synchronizeChangesOnly=true

# to sync on each alfresco startup

synchronization.syncOnStartup=true

synchronization.syncWhenMissingPeopleLogIn=true

### DONT USE UNIX CRON EXPRESSION- USE QUARTZ CRON EXPRESSIONS!!!

### look here http://www.quartz-scheduler.org/docs/tutorials/crontrigger.html

### synchronisation starts every 15 minutes!

synchronization.import.cron=0 0/15 * * * ?

ldap.authentication.allowGuestLogin=true

###############################

## Common Alfresco Properties #

###############################

dir.root=/contentmgt/alfresco/alf_data

alfresco.context=alfresco

alfresco.host=127.0.0.1

alfresco.port=9090

alfresco.protocol=http

share.context=share

share.host=127.0.0.1

share.port=9090

share.protocol=http

### database connection properties ###

db.driver=org.postgresql.Driver

db.username=alfresco

db.password=admin

db.name=alfresco

db.url=jdbcSmiley Tongueostgresql://localhost:5432/${db.name}

# Note: your database must also be able to accept at least this many connections.  Please see your database documentation for instructions on how to configure this.

db.pool.max=275

db.pool.validate.query=SELECT 1

# The server mode. Set value here

# UNKNOWN | TEST | BACKUP | PRODUCTION

system.serverMode=UNKNOWN

### FTP Server Configuration ###

### RMI registry port for JMX ###

alfresco.rmi.services.port=50500

### External executable locations ###

ooo.exe=/contentmgt/alfresco/libreoffice/program/soffice.bin

ooo.enabled=true

ooo.port=9100

img.root=/contentmgt/alfresco/common

img.dyn=${img.root}/lib

img.exe=${img.root}/bin/convert

jodconverter.enabled=false

jodconverter.officeHome=/contentmgt/alfresco/libreoffice

jodconverter.portNumbers=9100

### Initial admin password ###

alfresco_user_store.adminpassword=209c6174da490caeb422f3fa5a7ae634

### E-mail site invitation setting ###

notification.email.siteinvite=false

### License location ###

dir.license.external=/contentmgt/alfresco

### Solr indexing ###

index.subsystem.name=solr4

dir.keystore=${dir.root}/keystore

solr.host=localhost

solr.port.ssl=9443

### Allow extended ResultSet processing

security.anyDenyDenies=false

### Smart Folders Config Properties ###

smart.folders.enabled=false

### Remote JMX (Default: disabled) ###

alfresco.jmx.connector.enabled=false

### Audit Configuration

  audit.enabled=true

  audit.alfresco-access.enabled=true

  ### FTP Server Configuration ###

  ftp.enabled=true

  ftp.port=1121

###properties of OCR ####

# Define a default priority for this transformer

content.transformer.ocr.tiff.available=true

content.transformer.ocr.tiff.priority=10

# List the transformations that are supported

content.transformer.ocr.tiff.extensions.tiff.txt.supported=true

content.transformer.ocr.tiff.extensions.tiff.txt.priority=10

content.transformer.ocr.tiff.extensions.jpg.txt.supported=true

content.transformer.ocr.tiff.extensions.jpg.txt.priority=10

content.transformer.ocr.tiff.extensions.png.txt.supported=true

content.transformer.ocr.tiff.extensions.png.txt.priority=10

content.transformer.ocr.tiff.extensions.gif.txt.supported=true

content.transformer.ocr.tiff.extensions.gif.txt.priority=10

# Define a default priority for this transformer

content.transformer.pdf.tiff.available=true

content.transformer.pdf.tiff.priority=100

# List the transformations that are supported

content.transformer.pdf.tiff.extensions.pdf.txt.supported=true

content.transformer.pdf.tiff.extensions.pdf.txt.priority=100

# Disable the OOTB transformers

content.transformer.double.ImageMagick.extensions.pdf.tiff.supported=false

content.transformer.double.ImageMagick.extensions.pdf.png.supported=false

content.transformer.complex.PDF.Image.extensions.pdf.tiff.supported=false

content.transformer.complex.PDF.Image.extensions.pdf.png.supported=false

content.transformer.ImageMagick.extensions.pdf.tiff.supported=false

content.transformer.ImageMagick.extensions.pdf.png.supported=false

content.transformer.PdfBox.extensions.pdf.txt.supported=false

content.transformer.TikaAuto.extensions.pdf.txt.supported=false

###Ldap####

authentication.chain=ldap-ad1:ldap-ad,alfrescoNtlm1:alfrescoNtlm

ntlm.authentication.sso.enabled=false

ldap.authentication.active=true

ldap.authentication.userNameFormat=%s@netaddress.co.in

ldap.authentication.java.naming.factory.initial=com.sun.jndi.ldap.LdapCtxFactory

ldap.authentication.java.naming.provider.url=ldap://IPADDRESS

ldap.authentication.java.naming.security.authentication=simple

ldap.authentication.escapeCommasInBind=false

ldap.authentication.escapeCommasInUid=false

ldap.authentication.defaultAdministratorUserNames=contentmgt

ldap.synchronization.active=true

ldap.synchronization.java.naming.security.principal=contentmgt@netaddress.co.in

ldap.synchronization.java.naming.security.credentials=contentmgt123

ldap.synchronization.queryBatchSize=10000

ldap.synchronization.attributeBatchSize=10000

ldap.synchronization.groupQuery=(objectclass\=group)

ldap.synchronization.personQuery=objectClass=user

ldap.synchronization.groupDifferentialQuery=(&(objectclass\=group)(!(modifyTimestamp<\={0})))

ldap.synchronization.personDifferentialQuery=(&(objectclass\=user)(userAccountControl\:1.2.840.113556.1.4.803\:\=512)(!(modifyTimestamp<\={0})))

ldap.synchronization.groupSearchBase=ou=RDGROUP\,dc=netaddress,dc=co,dc=in

ldap.synchronization.userSearchBase=ou=RDGROUP,dc=netaddress,dc=co,dc=in

cifs.domain=netaddress.co.in

ldap.synchronization.modifyTimestampAttributeName=whenChanged

ldap.synchronization.timestampFormat=yyyyMMddHHmmss'.0Z'

ldap.synchronization.userIdAttributeName=sAMAccountName

###ldap.synchronization.userFirstNameAttributeName=givenName

ldap.synchronization.userFirstNameAttributeName=displayName

ldap.synchronization.userLastNameAttributeName=sn

ldap.synchronization.userEmailAttributeName=mail

ldap.synchronization.groupIdAttributeName=cn

ldap.synchronization.defaultHomeFolderProvider=userHomesHomeFolderProvider

ldap.synchronization.groupType=group

ldap.synchronization.personType=user

ldap.synchronization.groupMemberAttributeName=member

synchronization.autoCreatePeopleOnLogin=true

synchronization.synchronizeChangesOnly=true

# to sync on each alfresco startup

synchronization.syncOnStartup=true

synchronization.syncWhenMissingPeopleLogIn=true

### DONT USE UNIX CRON EXPRESSION- USE QUARTZ CRON EXPRESSIONS!!!

### look here http://www.quartz-scheduler.org/docs/tutorials/crontrigger.html

### synchronisation starts every 15 minutes!

synchronization.import.cron=0 0/15 * * * ?

ldap.authentication.allowGuestLogin=true

How have you enable the OCR?

Have you checked if it's not the cause of your problem?

Maybe the problem is that the your OCR generator is executing again for each PDF it creates. That would explain this repository grow.