I'm looking for a way to index the contents of .mm files on Alfresco 4.1 for Windows 7 so I can search within each mind map. So far I've found a promising method below but I can't seem to get it to work:
The best way seems to be extracting each .mm file's text by getting the values of each node's text attribute. An XSLT in the method posted renders all the text clearly in a web browser after renaming the mind map to an xml, then adding the xml prologue and stylesheet tags. Can't seem to figure out a more elegant solution so Alfresco searches each mind map as if it were an XML, yet still returns an .mm file. The method I've already found suggests using xmlstarlet from within Alfresco's RuntimeExecutableContentTransformer bean. However this resulted in a server missing resource error when I tried it on Windows.
Does anyone have a step by step process for implementing full text searching of mind maps through content transformers?
You can configure an analysis bundle for any model, type or property (as well as a default). You also need a transformer to turn your file into a text version. With these two parts you can keep your original file and be able to search it. What you need to do to get an indexable text file and how to tokenise this is up to you!
add an mm to txt content transformer to alfresco. in the transformer script transform the mm file with xsltproc into a text file using the xslt-stylesheet mm2text.xsl included with freemind.