<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: OCR pdf have some shift of selection bug in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55934#M20373</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;As you showed with your screenshots, it seems to be a problem with the pdf viewing component in Alfresco. The Browser PDF Components seem to be aligned better.&lt;BR /&gt;What OS and Version of Alfresco are you running? Maybe it's a problem with the installed Fonts...&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 26 Jan 2018 13:35:47 GMT</pubDate>
    <dc:creator>mehe</dc:creator>
    <dc:date>2018-01-26T13:35:47Z</dc:date>
    <item>
      <title>OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55932#M20371</link>
      <description>Hello!Looks like Alfresco (Community 201707) have some trouble with searchable word selection into OCRed pdf files?See picture.See some shift of selection, not really under the real words and letters. Its unusable if need select and copy/paste some word from OCR pdf.Its a bug (please confirm somebod</description>
      <pubDate>Fri, 26 Jan 2018 12:16:19 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55932#M20371</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-01-26T12:16:19Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55933#M20372</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Its the bug of the tool which you are using and not of alfresco.For example , if you are using pdfsandwich for performing ocr than its bug of pdfsandwich. Not sure which tool you are using.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Basically when you perform the OCR, what every tool does is creating a text from the pdf/image and put it on behind the what we visually looking at.So it often happens that the coordinates become bit of wrong.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;OCR depends on the quality of image as well.It is totally depends on the capability of the OCR Tool.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jan 2018 12:55:51 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55933#M20372</guid>
      <dc:creator>krutik_jayswal</dc:creator>
      <dc:date>2018-01-26T12:55:51Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55934#M20373</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;As you showed with your screenshots, it seems to be a problem with the pdf viewing component in Alfresco. The Browser PDF Components seem to be aligned better.&lt;BR /&gt;What OS and Version of Alfresco are you running? Maybe it's a problem with the installed Fonts...&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jan 2018 13:35:47 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55934#M20373</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2018-01-26T13:35:47Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55935#M20374</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;My first thinking - ocr tool problem (yes its sandwich). But I specially make two other windows with Chrome pdf rendering engine and Adobe Reader engine with same pdf. As you can see shift problem only in Alfresco Document Details view.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I will try just add clear pdf maked in other ocr (if find it) or directly from pdf maker. But think problem not gone.&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jan 2018 14:06:30 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55935#M20374</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-01-26T14:06:30Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55936#M20375</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Can you upload same pdf document in forum..So that i can check it in my system....Thank you&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jan 2018 14:13:01 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55936#M20375</guid>
      <dc:creator>krutik_jayswal</dc:creator>
      <dc:date>2018-01-26T14:13:01Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55937#M20376</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;SPAN style="color: #333333; background-color: #ffffff; font-size: 14px;"&gt;Community Edition 201707 GA&amp;nbsp; &amp;nbsp; Ubuntu 16.04 (server and client)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333; background-color: #ffffff; font-size: 14px;"&gt;Yes, browser show all right.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333; background-color: #ffffff; font-size: 14px;"&gt;Adobe Reader - ok.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333; background-color: #ffffff; font-size: 14px;"&gt;Usability of selection poor - select visually one word but really got word from upper string if strings have little space..&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;OCRed pdf added, please try it on other system, viewer or Alfresco.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;P.S. forgot.. the same bad view in Opera (also Chrome engine) in Afresco under Windows OS client.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jan 2018 14:17:48 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55937#M20376</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-01-26T14:17:48Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55938#M20377</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Done. Thanks&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jan 2018 14:19:51 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55938#M20377</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-01-26T14:19:51Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55939#M20378</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;So anybody can confirm selection problem in OCRed pdf? (TestOCR.pdf in first message)&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Jan 2018 06:00:45 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55939#M20378</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-01-29T06:00:45Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55940#M20379</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;yes,&lt;/P&gt;&lt;P&gt;I also have the problem with the "offset" when select Text of your PDF&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Jan 2018 06:06:06 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55940#M20379</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2018-01-29T06:06:06Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55941#M20380</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Ok, thank you, its about "my pdf". What about another "correct pdf" may be you have it?&amp;nbsp;&lt;/P&gt;&lt;P&gt;Just need to prove its global or local problem.&amp;nbsp;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Jan 2018 09:33:47 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55941#M20380</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-01-29T09:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: OCR pdf have some shift of selection bug</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55942#M20381</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello!&lt;/P&gt;&lt;P&gt;Ok, make some changes..&amp;nbsp; Now possible to use&amp;nbsp;&lt;STRONG&gt;tesseract 4.00.00alpha&lt;/STRONG&gt; under ocrmypdf or pdfsandwich&lt;/P&gt;&lt;P&gt;And selection in Alfresco now is little bit normal.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But, selection lost the spaces between the words&amp;nbsp;&lt;SPAN&gt;(in Alfresco, in other viewers ok)&lt;/SPAN&gt;.&lt;/P&gt;&lt;P&gt;What can do with this thing?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 Feb 2018 13:29:21 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/ocr-pdf-have-some-shift-of-selection-bug/m-p/55942#M20381</guid>
      <dc:creator>djarty</dc:creator>
      <dc:date>2018-02-07T13:29:21Z</dc:date>
    </item>
  </channel>
</rss>

