cancel
Showing results for 
Search instead for 
Did you mean: 

Disable auto extract metadata from pdf

sanjaybandhaniya
Elite Collaborator
Elite Collaborator

I am exploring docker base alfresco version 7.2.0

I found link which will generate setup for alfresco.

https://github.com/Alfresco/alfresco-docker-installer.

I want to disable auto extract metadata from pdf. How Can I disable?

I tried below property but not working.

# pdfs
extracter.PDFBox.enabled=false
# office types
extracter.Office.doc.enabled=false
extracter.Office.xls.enabled=false 
extracter.Office.ppt.enabled=false
# images (no exif aspect)
extracter.TikaAuto.enabled=false
# docx,xlsx,pptx
extracter.Poi.enabled=false

I think extraction functionality is moved to transformation service.

2 REPLIES 2

signshah
Champ in-the-making
Champ in-the-making

Hi Sanjay,

Any luck on this?

I have a issue wiht my Alfresco 7.3.0 where the metadata extraction is not happening and also the thumbnails are not getting generated. 

openpj
Elite Collaborator
Elite Collaborator

I think that now you should override the engine_config.json inside the Transform Core AIO or in the specific transform service you are currently deploying in your Alfresco installation.

Here you have an example of how you can override the default configuration provided by the Alfresco Transform Service:

https://github.com/aborroy/alfresco-transform-service-3.0.0

Hope this helps Smiley Wink