I am creating a lot of PDF-formatted Invoices (estimated 20.000 to 200.000 in one bulkrun) using DOC1 of Group1 and do not want any indexing performed when I upload these files to Alfresco. Default cm:content gets indexed. How can I solve this problem? Any suggestions are welcome. Thanks in advance.
Alternatively, you can create a custom org.alfresco.repo.content.transform.PdfBoxContentTransformer and wire it in. This currently converts Pdf to Text for indexing. Your custom one can check for aspects / type and return null for your invoices. They will not be indexed.