cancel
Showing results for 
Search instead for 
Did you mean: 

Alfresco Auto Tagging

jboss
Champ in-the-making
Champ in-the-making
I was wondering if Alfresco supported some kind of auto-tagging service out of the box.   I know it allows for tagging, and you can use the API to add tags to uploads, but is there any kind of built in service to handle auto-tagging?

If not would the OpenCalais integration I've read about on the forums be my best choice for some kind of auto-tag service?
6 REPLIES 6

mrogers
Star Contributor
Star Contributor
How do you imagine that "auto tagging" would work?   What would your buisness rules look like.

There is already the ability to extract metadata from documents and the ability to run code when documents are added so it shouldn't be too difficult to apply some sort of tag.

jboss
Champ in-the-making
Champ in-the-making
I guess what I had in mind was parsing the document and tagging based on commonly used words, or possibly trying to correlate those with similar words from previous searches.  But really I just needed an answer, so thanks  Smiley Happy .  I'll look into other options/writing our own service.

stevereiner
Champ in-the-making
Champ in-the-making
The OpenCalais Integration Alfresco extension available in the Alfresco Forge can be used to auto tag your documents based on their content. http://forge.alfresco.com/projects/calais/  This extension also has Share UI with it http://integratedsemantics.org/2011/03/08/alfresco-opencalais-integration-share-ui/ in addition to UI for it being available in FlexSpaces.   Calais / OpenCalais is not open src, its an"open api" web service that is free for low volumes.

IKS / Apache Stanbol is a completely open src / free stack  (when it uses OpenNLP instead of the optional use of OpenCalais).   Zaizi is working on an Alfresco integration to IKS / Apache Stanbol.    Integrated Semantics will be doing additional UI for Apache Stanbol  + Alfresco   http://code.google.com/p/semantics4alfresco/

jboss
Champ in-the-making
Champ in-the-making
I added OpenCalais to my Alfresco installation, I'm still testing its semantic tagging, but what happens when I exceed these 'low volumes'?  And do you have an idea of where that break might be, would it be a certain number of documents/kb per month?

jboss
Champ in-the-making
Champ in-the-making
Just saw this: http://www.opencalais.com/documentation/calais-web-service-api/usage-quotas
If a number of documents are submitted at the same time, is the processing rate automatically capped at 4 per second then?

stevereiner
Champ in-the-making
Champ in-the-making
You would have to check with OpenCalais support for more info on the limits. The Calais integration has been ported to Apache Stanbol, which is open source and won't have these limits  http://code.google.com/p/semantics4alfresco/