cancel
Showing results for 
Search instead for 
Did you mean: 

OCR pdf have some shift of selection bug

djarty
Star Contributor
Star Contributor

Hello!

Looks like Alfresco (Community 201707) have some trouble with searchable word selection into OCRed pdf files?

See picture.

See some shift of selection, not really under the real words and letters. Its unusable if need select and copy/paste some word from OCR pdf.

Its a bug (please confirm somebody) or can be tuned for normal look?

On picture also Chrome and Adobe Reader window with same file and normal selection directly under letters. So its looks like not OCR engine problem but Alfresco rendering.

10 REPLIES 10

djarty
Star Contributor
Star Contributor

Hello!

Ok, make some changes..  Now possible to use tesseract 4.00.00alpha under ocrmypdf or pdfsandwich

And selection in Alfresco now is little bit normal.

But, selection lost the spaces between the words (in Alfresco, in other viewers ok).

What can do with this thing?