topic Alfresco Developer Guide, PdfBoxMetadataExtracter problem in Alfresco Archive

Alfresco Developer Guide, PdfBoxMetadataExtracter problem

col_edinburgh — Mon, 18 Jul 2011 09:37:49 GMT

I am following the guide in the book Alfresco Developer Guide but have encountered a something I don't understand.In chapter 4, Digging into the developer class the book refers to org.alfresco.repo.content.metadata.PdfBoxMetadataExtracter class‍and describe the code belowDDocumentInformation docIn

Re: Alfresco Developer Guide, PdfBoxMetadataExtracter problem

mrogers — Tue, 19 Jul 2011 07:06:56 GMT

Since that example was written, it looks like the "old" PDFBox metadata extractor has been replaced with a Apache Tika based extractor.
However, from the few lines you have given, it still looks like a good example, even though it does not match the current code.

Re: Alfresco Developer Guide, PdfBoxMetadataExtracter problem

col_edinburgh — Tue, 19 Jul 2011 10:12:25 GMT

i'm now totally lost.

Using the examples in the Developer guide, I have imported into Eclipse the code from

http://www.packtpub.com/files/code/3117_Code.zip‍

run Ant Build on the code from Chapter 2 example and copied to Alfresco and its works.
run Ant Build on the code from Chapter 3 example and copied to Alfresco and its works.
run Ant Build on the code from Chapter 4 example and copied to Alfresco and it fails - http 404

one step forward and two back.

Re: Alfresco Developer Guide, PdfBoxMetadataExtracter problem

col_edinburgh — Tue, 19 Jul 2011 10:37:15 GMT

Since that example was written, it looks like the "old" PDFBox metadata extractor has been replaced with a Apache Tika based extractor.

Thanks, back to thew drawing board