cancel
Showing results for 
Search instead for 
Did you mean: 

How Alfresco full text search and Lucene indexing

user01
Champ in-the-making
Champ in-the-making
Hi,

I have a question related to the way the full text search works.

As far as I read, when a file is uploaded in the repository, in order to be indexed, Lucene needs to read the document as plain/text. Regardless of the mimetype of the document, a content transformation will be applied.

If so, how can I see what Lucene indexes? How can I see the output of the content transformation? (I have used Luke, but I cannot see the output of my transformed content).

Is this transformation affecting the document content which is stored on the filesystem?

Thank you in advance for your answer!
2 REPLIES 2

mitpatoliya
Star Collaborator
Star Collaborator
Yes, content will be transformed first in to plain text then will be indexed in the lucene.
Also, It will just remove all the formatting but I do not thing there will be any loss of information will happen during content transformation.
Why exactly you want to check it?

user01
Champ in-the-making
Champ in-the-making
Hi,

Thank you for the answer!