05-21-2019 08:45 PM
greetings ... I am evaluating Alfresco to see if it possible to set up a forward facing website to host 20000 pdf files (its a old newspaper archive ) with the full text ocr SOLR integration. Would Alfresco be a platform that I could build this application on ?Thanks.. if so , any suggestions on Templates /etc would be greatly appreciated
05-26-2019 03:11 PM
Hi:
Yes, but OCR features are not out of the box (neither in EE version). Despite of that, you may use some Community addon for using a non-commercial OCR (tesseract, pdfsandwich..) . In the implementation, you should use a dedicated SOLR server, because surely your final SOLR indices are going to grow and to occupy a significant percent of your contentstore.
Kind regards.
--C.
P.S: This question is not related to ADF, maybe it should be in a more general forum (Content Services) for example.
Explore our Alfresco products with the links below. Use labels to filter content by product module.