cancel
Showing results for 
Search instead for 
Did you mean: 

Metadata extractor

dleeuwen
Champ in-the-making
Champ in-the-making
Hi,

Does anyone know how to extract custom metadata using the metadata extractor from alfresco, or how to use the metadata extractor at all?

I searched the whole forum and the wiki but couldn't find anything on how to use the metadata extractor from alfresco.

I hope someone can help me with this problem.

Thanks,

Danny
5 REPLIES 5

dleeuwen
Champ in-the-making
Champ in-the-making
Anyone? Or does anyone know where to find the file wich contains the java class or something, so I can modify it.

kevinr
Star Contributor
Star Contributor
The package you want to look at is:
org.alfresco.repo.content.metadata
Found in the Repository project, this contains the various meta-data extractor class implementations and the MetadataExtractorRegistry.

The configuration for the meta-data extractors (you can add your own) is found in config/alfresco/content-services-context.xml in this section:

   <!– Content Metadata Extracters –>
   <bean class="org.alfresco.repo.content.metadata.PdfBoxMetadataExtracter" parent="baseMetadataExtracter" />
   <bean class="org.alfresco.repo.content.metadata.OfficeMetadataExtracter" parent="baseMetadataExtracter" />
   <bean class="org.alfresco.repo.content.metadata.MailMetadataExtracter" parent="baseMetadataExtracter" />
   <bean class="org.alfresco.repo.content.metadata.HtmlMetadataExtracter" parent="baseMetadataExtracter" />
   <bean class="org.alfresco.repo.content.metadata.MP3MetadataExtracter" parent="baseMetadataExtracter" />
   <bean class="org.alfresco.repo.content.metadata.OpenDocumentMetadataExtracter" parent="baseMetadataExtracter" />
   <bean class="org.alfresco.repo.content.metadata.OpenOfficeMetadataExtracter" parent="baseMetadataExtracter" init-method="init" >
      <property name="connection">
         <ref bean="openOfficeConnection" />
      </property>
   </bean>

Hope this helps,

Kevin

dleeuwen
Champ in-the-making
Champ in-the-making
Yes this helps, now I know where too look at. Thanks a lot,

Danny

dleeuwen
Champ in-the-making
Champ in-the-making
Maybe a stupid question but I have tried to find the directory but I couldn't find it. Is this because I use the installer from the enterprise edition? Anyway, do you know where I could find the package org.alfresco.repo.content.metadata? Is it in a JAR file or something?

kevinr
Star Contributor
Star Contributor
You need to download the SDK to access the source - the package structure mentioned will be in the 'repository' project source/java dir.

Thanks,

Kevin