cancel
Showing results for 
Search instead for 
Did you mean: 

Unable to search contents of MSOffice2007 word document

aliciac
Champ in-the-making
Champ in-the-making
Hi All,

I've come across an issue in community Alfresco Lab 3b linux version where in the Web Client i try to do a search for a word that is contained in a MSOffice2007 word document (.docx), but the search result does not return the document. It can find titles of .docx files, just not content for .docx files.
I've tried with an older MSOffice word document (.doc) and it can search the contents of a document successfully.
Does anyone else have this issue with the search function?

Cheers
4 REPLIES 4

sbuckle
Champ in-the-making
Champ in-the-making
I tried on a slightly earlier version and no luck either. OpenOffice 3.0, which was only released a few weeks ago has read-only support for .docx files so you could try upgrading to the latest version of that and hope it works - the reason I say this is that (I believe) most documents are converted to plain text first using the OpenOffice converter before they are indexed. You can't index a .docx file directly as it's a zip file - try unzipping it and you'll find all the appropriate XML files in there. If all else fails you might want to try converting the file to something that can be indexed before uploading it to Alfresco.

aliciac
Champ in-the-making
Champ in-the-making
Thanks for the tip. I will try upgrading to the new OpenOffice *fingers crossed

aliciac
Champ in-the-making
Champ in-the-making
Just to report back, i did the upgrade to OpenOffice 3 … and no luck. I still can't do a search for contents in a .docx file  :cry:
I guess we'll just have to deal with it.

sbuckle
Champ in-the-making
Champ in-the-making
I had a look on the issue tracker and came across this:

https://issues.alfresco.com/jira/browse/ETWOTWO-767

Seems to be a known problem.