I've come across an issue in community Alfresco Lab 3b linux version where in the Web Client i try to do a search for a word that is contained in a MSOffice2007 word document (.docx), but the search result does not return the document. It can find titles of .docx files, just not content for .docx files. I've tried with an older MSOffice word document (.doc) and it can search the contents of a document successfully. Does anyone else have this issue with the search function?
I tried on a slightly earlier version and no luck either. OpenOffice 3.0, which was only released a few weeks ago has read-only support for .docx files so you could try upgrading to the latest version of that and hope it works - the reason I say this is that (I believe) most documents are converted to plain text first using the OpenOffice converter before they are indexed. You can't index a .docx file directly as it's a zip file - try unzipping it and you'll find all the appropriate XML files in there. If all else fails you might want to try converting the file to something that can be indexed before uploading it to Alfresco.
Just to report back, i did the upgrade to OpenOffice 3 … and no luck. I still can't do a search for contents in a .docx file :cry: I guess we'll just have to deal with it.