cancel
Showing results for 
Search instead for 
Did you mean: 

Problem with PDF document 10MB 140 pages scanned OCR

wmay
Champ in-the-making
Champ in-the-making
I uploaded a scanned PDF document which is OCR´ed - 10MB file size, 140 pages, b&w scan, PDF with text below the image - via the Alfresco Explorer and also via CIFS.

The document is uploaded correct, can downloaded and can displayed - but it is not searchable via fulltext search. When i try to display  the text preview from the "i" icon in the Alfresco explorer it is not possible - when i click to this icon the icon image starts to "rotate" and then after a while an error message comes up "500: Internal server error".

I did the same test with different 3.2 CE - with allways the same result - i repeated this tests also with the EE 3.0 - same result.

It seems the the server is not able to extract the text information from the PDF and to add this text to the fulltext index because it is not possible to search via fulltext and it is not possible to view the extracted text via the Alfresco explorer.

I splitted the same PDF document to 6 chapter-documents with max. 3,7MB and max. 50 pages each chapter / document and all works fine - no problem with the search and no problem with the text extraction. The hidden Text of the PDF can extracted by CTRL+A from the PDF and inserted to e.g. notepad without a problem.

My questions

- Does anybody had the same problem with "bigger" PDF documents
- is there a restriction for the size of the e.g. PDF documents to upload and to index (text extraction)
- are there any known problems with the PDF text extraction - file size ?
- are there any settings and configurations which control this process (timeout, size restrictions…)
- what does this error message "500: Internal server error" means ?

If needed i can post a download link for the PDF to make tests.

I hope that there is not a "basic" problem or restriction with Alfresco because the PDF text extraction and the indexing of "bigger" PDF files should work without a problem - from my understanding a scanned and OCRed 140 page document is not a "big" but a normal document which should work without a problem.

Can it be possible that the implemeted PDF text extractor is a weak point or component of Alfresco ?

Does anybody has tested and implemented other text extractors which work better and more reliable

Thank you for your answers and help
6 REPLIES 6

ofrxnz
Champ in-the-making
Champ in-the-making
- Does anybody had the same problem with "bigger" PDF documents
i have dropped simultaneously dropped multiple 25+mb pdfs in with no issue….sometimes it just takes forever to generate previews and what not
- is there a restriction for the size of the e.g. PDF documents to upload and to index (text extraction)
dont think so
- are there any known problems with the PDF text extraction - file size ?
i have had some memory issues before when i did a 500pg image pdf….it took 3 hours and slammed the memmory….didnt crash, but sure locked up the server. 

- are there any settings and configurations which control this process (timeout, size restrictions…)
not that i know of

- what does this error message "500: Internal server error" means ?
usually means the server exploded….its kind of a generic error that means something didnt fail gracefully

what is your system configuration?
operating system?
open office version?
alfresco version?
pdf2swf version?
image magick version?

wmay
Champ in-the-making
Champ in-the-making
what is your system configuration?
I tested it on different installations, the most installations are based on DEBIAN linux running on a XEN server or on a real server, 1 to 3 GB RAM
operating system?
Debian Linux, MySQL
alfresco version?
CE 3.2 but i tested the same also with a EE 3.0 - same result - same error

Because i tested it on different installations and also with different Alfresco versions - CE 3.2 and EE 3.0 with the same result and also with the same error message - so i think it must be a base problem or a PDF problem but not an installation problem….

Here you can download my testfile maybe you have the same or a different  result on your server.

http://www.may.co.at/webrr/respfile.php?file=MAYComp/Current/RO/Alfresco_PDF_Testfile.zip

wmay
Champ in-the-making
Champ in-the-making
Here the missing install information

configuration:
Alfresco-Community-3.2-Linux-x86 using a mysql backend
operating system: Debian squeeze/sid
openoffice version: 1:3.1.0-5
swftools: 0.8.1-2.1
imagemagick: 7:6.5.1.0-1.1

ofrxnz
Champ in-the-making
Champ in-the-making
The Preview issue is related to your PDF2SWF version.  you will want to use version 0.9.0 i had to build it from source. 

The notes are found on this wiki page http://wiki.alfresco.com/wiki/Installing_Alfresco_components

Alfresco Share leverages the features provided in the Development snapshots of the tools. Install a version post 0.8.1. Use the last development snapshots, because version 0.8.1 from 2007-02-28 does not support some functionalities Alfresco needs to render the preview.

download is here
http://www.swftools.org/download.html

as for the searchability, it seems to be a bit hit or miss with your test document….Im going to give it a little while, see fi there are any log entries and if i find anything ill let you know.

dranakan
Champ on-the-rise
Champ on-the-rise
Hello,

I have try to add your file in a Alfresco 3.0 / Windows XP SP3 / MySql.

I can upload but cannot see the preview ("500: Internal server error").

I confirm you that there is a problem… ok it could be better to have the solution… :roll:

Here are the log error :

14:54:47,602 DEBUG [org.alfresco.repo.transaction.RetryingTransactionHelper]
Transaction commit failed:
   Thread: http-8080-1
   Txn:    UserTransaction[object=org.alfresco.util.transaction.SpringAwareUserTransaction@13fd850, status=0]
   Iteration: 0
   Exception follows:
java.lang.reflect.InvocationTargetException
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.alfresco.web.app.servlet.ajax.InvokeCommand$1.execute(InvokeCommand.java:163)
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:320)
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:227)
   at org.alfresco.web.app.servlet.ajax.InvokeCommand.execute(InvokeCommand.java:167)
   at org.alfresco.web.app.servlet.ajax.AjaxServlet.service(AjaxServlet.java:148)
   at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
   at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
   at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
   at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
   at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
   at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:128)
   at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
   at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
   at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:286)
   at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:845)
   at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583)
   at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447)
   at java.lang.Thread.run(Thread.java:619)
Caused by: org.alfresco.service.cmr.repository.TemplateException: Error during processing of the template 'Content conversion failed:
   reader: ContentAccessor[ contentUrl=store://2009/8/6/14/47/525c3c11-7328-4b0a-b8e4-5c56e2266120.bin, mimetype=application/pdf, size=10229463, encoding=UTF-8, locale=fr_FR]
   writer: ContentAccessor[ contentUrl=store://2009/8/6/14/54/12ad16e3-25c5-4160-9eef-dc83668aa61c.bin, mimetype=text/plain, size=0, encoding=UTF-8, locale=fr_FR]
   options: org.alfresco.service.cmr.repository.TransformationOptions@1ce8a52'. Veuillez contacter votre administrateur système.
   at org.alfresco.repo.template.FreeMarkerProcessor.process(FreeMarkerProcessor.java:205)
   at org.alfresco.repo.processor.TemplateServiceImpl.processTemplate(TemplateServiceImpl.java:181)
   at org.alfresco.repo.processor.TemplateServiceImpl.processTemplate(TemplateServiceImpl.java:111)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:296)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:177)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:144)
   at org.alfresco.repo.security.permissions.impl.AlwaysProceedMethodInterceptor.invoke(AlwaysProceedMethodInterceptor.java:40)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.security.permissions.impl.ExceptionTranslatorMethodInterceptor.invoke(ExceptionTranslatorMethodInterceptor.java:49)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.audit.AuditComponentImpl.audit(AuditComponentImpl.java:275)
   at org.alfresco.repo.audit.AuditMethodInterceptor.invoke(AuditMethodInterceptor.java:69)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:107)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
   at $Proxy187.processTemplate(Unknown Source)
   at org.alfresco.web.bean.ajax.NodeInfoBean.sendNodeInfo(NodeInfoBean.java:96)
   … 22 more
Caused by: org.alfresco.service.cmr.repository.ContentIOException: Content conversion failed:
   reader: ContentAccessor[ contentUrl=store://2009/8/6/14/47/525c3c11-7328-4b0a-b8e4-5c56e2266120.bin, mimetype=application/pdf, size=10229463, encoding=UTF-8, locale=fr_FR]
   writer: ContentAccessor[ contentUrl=store://2009/8/6/14/54/12ad16e3-25c5-4160-9eef-dc83668aa61c.bin, mimetype=text/plain, size=0, encoding=UTF-8, locale=fr_FR]
   options: org.alfresco.service.cmr.repository.TransformationOptions@1ce8a52
   at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:225)
   at org.alfresco.repo.content.RoutingContentService.transform(RoutingContentService.java:505)
   at org.alfresco.repo.content.RoutingContentService.transform(RoutingContentService.java:465)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:296)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:177)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:144)
   at net.sf.acegisecurity.intercept.method.aopalliance.MethodSecurityInterceptor.invoke(MethodSecurityInterceptor.java:80)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.model.ml.MLContentInterceptor.invoke(MLContentInterceptor.java:131)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.security.permissions.impl.ExceptionTranslatorMethodInterceptor.invoke(ExceptionTranslatorMethodInterceptor.java:49)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.audit.AuditComponentImpl.audit(AuditComponentImpl.java:275)
   at org.alfresco.repo.audit.AuditMethodInterceptor.invoke(AuditMethodInterceptor.java:69)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:107)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
   at $Proxy31.transform(Unknown Source)
   at org.alfresco.repo.template.BaseContentNode$TemplateContentData.getContentAsText(BaseContentNode.java:557)
   at org.alfresco.repo.template.CropContentMethod.exec(CropContentMethod.java:65)
   at freemarker.core.MethodCall._getAsTemplateModel(MethodCall.java:93)
   at freemarker.core.Expression.getAsTemplateModel(Expression.java:89)
   at freemarker.core.Assignment.accept(Assignment.java:90)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.MixedContent.accept(MixedContent.java:92)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.ConditionalBlock.accept(ConditionalBlock.java:79)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.MixedContent.accept(MixedContent.java:92)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.Environment.process(Environment.java:188)
   at freemarker.template.Template.process(Template.java:237)
   at org.alfresco.repo.template.FreeMarkerProcessor.process(FreeMarkerProcessor.java:201)
   … 43 more
Caused by: java.lang.ClassCastException: org.fontbox.cmap.CMapParser$Operator cannot be cast to [B
   at org.fontbox.cmap.CMapParser.parse(CMapParser.java:104)
   at org.pdfbox.pdmodel.font.PDFont.parseCmap(PDFont.java:535)
   at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:387)
   at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:325)
   at org.pdfbox.util.operator.ShowText.process(ShowText.java:64)
   at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
   at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:435)
   at org.pdfbox.util.operator.MoveAndShow.process(MoveAndShow.java:63)
   at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
   at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
   at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
   at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
   at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259)
   at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
   at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
   at org.alfresco.repo.content.transform.PdfBoxContentTransformer.transformInternal(PdfBoxContentTransformer.java:77)
   at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:216)
   … 80 more
14:54:47,695 ERROR [alfresco.ajax] Failed to execute method NodeInfoBean.sendNodeInfo: Exception in Transaction.
org.alfresco.error.AlfrescoRuntimeException: Exception in Transaction.
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:404)
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:227)
   at org.alfresco.web.app.servlet.ajax.InvokeCommand.execute(InvokeCommand.java:167)
   at org.alfresco.web.app.servlet.ajax.AjaxServlet.service(AjaxServlet.java:148)
   at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
   at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
   at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
   at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
   at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
   at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:128)
   at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
   at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
   at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:286)
   at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:845)
   at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583)
   at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447)
   at java.lang.Thread.run(Thread.java:619)
Caused by: java.lang.reflect.InvocationTargetException
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.alfresco.web.app.servlet.ajax.InvokeCommand$1.execute(InvokeCommand.java:163)
   at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:320)
   … 16 more
Caused by: org.alfresco.service.cmr.repository.TemplateException: Error during processing of the template 'Content conversion failed:
   reader: ContentAccessor[ contentUrl=store://2009/8/6/14/47/525c3c11-7328-4b0a-b8e4-5c56e2266120.bin, mimetype=application/pdf, size=10229463, encoding=UTF-8, locale=fr_FR]
   writer: ContentAccessor[ contentUrl=store://2009/8/6/14/54/12ad16e3-25c5-4160-9eef-dc83668aa61c.bin, mimetype=text/plain, size=0, encoding=UTF-8, locale=fr_FR]
   options: org.alfresco.service.cmr.repository.TransformationOptions@1ce8a52'. Veuillez contacter votre administrateur système.
   at org.alfresco.repo.template.FreeMarkerProcessor.process(FreeMarkerProcessor.java:205)
   at org.alfresco.repo.processor.TemplateServiceImpl.processTemplate(TemplateServiceImpl.java:181)
   at org.alfresco.repo.processor.TemplateServiceImpl.processTemplate(TemplateServiceImpl.java:111)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:296)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:177)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:144)
   at org.alfresco.repo.security.permissions.impl.AlwaysProceedMethodInterceptor.invoke(AlwaysProceedMethodInterceptor.java:40)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.security.permissions.impl.ExceptionTranslatorMethodInterceptor.invoke(ExceptionTranslatorMethodInterceptor.java:49)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.audit.AuditComponentImpl.audit(AuditComponentImpl.java:275)
   at org.alfresco.repo.audit.AuditMethodInterceptor.invoke(AuditMethodInterceptor.java:69)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:107)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
   at $Proxy187.processTemplate(Unknown Source)
   at org.alfresco.web.bean.ajax.NodeInfoBean.sendNodeInfo(NodeInfoBean.java:96)
   … 22 more
Caused by: org.alfresco.service.cmr.repository.ContentIOException: Content conversion failed:
   reader: ContentAccessor[ contentUrl=store://2009/8/6/14/47/525c3c11-7328-4b0a-b8e4-5c56e2266120.bin, mimetype=application/pdf, size=10229463, encoding=UTF-8, locale=fr_FR]
   writer: ContentAccessor[ contentUrl=store://2009/8/6/14/54/12ad16e3-25c5-4160-9eef-dc83668aa61c.bin, mimetype=text/plain, size=0, encoding=UTF-8, locale=fr_FR]
   options: org.alfresco.service.cmr.repository.TransformationOptions@1ce8a52
   at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:225)
   at org.alfresco.repo.content.RoutingContentService.transform(RoutingContentService.java:505)
   at org.alfresco.repo.content.RoutingContentService.transform(RoutingContentService.java:465)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:296)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:177)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:144)
   at net.sf.acegisecurity.intercept.method.aopalliance.MethodSecurityInterceptor.invoke(MethodSecurityInterceptor.java:80)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.model.ml.MLContentInterceptor.invoke(MLContentInterceptor.java:131)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.security.permissions.impl.ExceptionTranslatorMethodInterceptor.invoke(ExceptionTranslatorMethodInterceptor.java:49)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.alfresco.repo.audit.AuditComponentImpl.audit(AuditComponentImpl.java:275)
   at org.alfresco.repo.audit.AuditMethodInterceptor.invoke(AuditMethodInterceptor.java:69)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:107)
   at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:166)
   at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
   at $Proxy31.transform(Unknown Source)
   at org.alfresco.repo.template.BaseContentNode$TemplateContentData.getContentAsText(BaseContentNode.java:557)
   at org.alfresco.repo.template.CropContentMethod.exec(CropContentMethod.java:65)
   at freemarker.core.MethodCall._getAsTemplateModel(MethodCall.java:93)
   at freemarker.core.Expression.getAsTemplateModel(Expression.java:89)
   at freemarker.core.Assignment.accept(Assignment.java:90)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.MixedContent.accept(MixedContent.java:92)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.ConditionalBlock.accept(ConditionalBlock.java:79)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.MixedContent.accept(MixedContent.java:92)
   at freemarker.core.Environment.visit(Environment.java:208)
   at freemarker.core.Environment.process(Environment.java:188)
   at freemarker.template.Template.process(Template.java:237)
   at org.alfresco.repo.template.FreeMarkerProcessor.process(FreeMarkerProcessor.java:201)
   … 43 more
Caused by: java.lang.ClassCastException: org.fontbox.cmap.CMapParser$Operator cannot be cast to [B
   at org.fontbox.cmap.CMapParser.parse(CMapParser.java:104)
   at org.pdfbox.pdmodel.font.PDFont.parseCmap(PDFont.java:535)
   at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:387)
   at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:325)
   at org.pdfbox.util.operator.ShowText.process(ShowText.java:64)
   at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
   at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:435)
   at org.pdfbox.util.operator.MoveAndShow.process(MoveAndShow.java:63)
   at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
   at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
   at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
   at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
   at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259)
   at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
   at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
   at org.alfresco.repo.content.transform.PdfBoxContentTransformer.transformInternal(PdfBoxContentTransformer.java:77)
   at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:216)
   … 80 more

Good luck !

mwildam
Champ in-the-making
Champ in-the-making
I can upload but cannot see the preview ("500: Internal server error").

I confirm you that there is a problem… ok it could be better to have the solution… :roll:

Klick on the details link - but not that one which changes the page - use that one triggering the popup - then you can see the error.