cancel
Showing results for 
Search instead for 
Did you mean: 

Extract common metadata fields

ashwini
Champ in-the-making
Champ in-the-making
Hello ,

Would like to know how I can extract the metadata of Email (.MSG or .eml file). The metadata is nothing but a receivers address /senders address /date/subject and all .

I tried doing Extract common metadata fields for the Mimetype in cm:content is 'EMail' using Business Rule but whenever I uploaded .MSG file it shows null values in Originator,Addressee,Addressees,sent date and Subject.

I have configured IMAP as below

imap.server.enabled=true
imap.server.port=143
imap.server.host=172.16.1.130

where imap.server.host is nothing but a IP address where my Alfresco server is running which is on windows environment.

Do I have to configure anything else ?

Please help me to workout on this.

Thanks in advance!!!

Regards,
Ashwini
3 REPLIES 3

mrogers
Star Contributor
Star Contributor
Imap uses RFC822 (eml files).  You can configure which properties are extracted and where they go by overriding the RFC822MetadataExtractor.properties file which is basically a mapping from email header field to alfresco property name.

Office messages (msg files) are handled in a similar way by the MailMetadataExtractor.    The list of properties is more constrained by this extractor and does not support arbitary header fields.

ashwini
Champ in-the-making
Champ in-the-making
Imap uses RFC822 (eml files).  You can configure which properties are extracted and where they go by overriding the RFC822MetadataExtractor.properties file which is basically a mapping from email header field to alfresco property name.

Office messages (msg files) are handled in a similar way by the MailMetadataExtractor.    The list of properties is more constrained by this extractor and does not support arbitary header fields.

Thanks mrogers for your quick reply.

But I have not found both the files( RFC822MetadataExtractor.properties /MailMetadataExtractor) . Where  can I get these files or I have to create it ? If yes, how ?

Do I have to configure SMTP protocol /Email server for this ??

Thank you so much again!!

ashwini
Champ in-the-making
Champ in-the-making
Hello ,
Now i am able to extract the metadata of .eml file. But when I upload the .MSG file the mime-type of file shows as "unknown". So as per the Jira issue http://issues.alfresco.com/jira/browse/ALF-3376 I have updated the mimetype-map.xml. But even it doesn't work. S finally I have updated mimetype.ftl file with below mentioned line :

       <@mimetypeOption mt="application/vnd.ms-outlook" />

      <#elseif mt=="application/vnd.ms-outlook">
        <#return "Outlook MSG">


now it start showing mime-type as Outlook MSG for .MSG file but still metadata is not getting extracted for .MSG file.

How I can achieve this?

For .eml file only originator, sent date and subject line is getting extracted. What I have to do if I want to extract TO/CC and BCC field.


Any kind of help is appreciated..