I was wondering if I could specify which file types content should be indexed and searchable. I have a number of documents that I will be placing in Alfresco that include Word Document, PDF, HTML pages etc. I do not want the HTML pages to be searchable. Where do I configure this?
This is controlled by which content types can be converted to text. If there is no trasnaform from html to text then the content of html docs wil not be indexed.