<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Does Alfresco provide checksum for managed content to aid detecting duplicates? in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50205#M19234</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I am not finding much documentation on fingerprinting of image and other media content. Any idea if this has been designed to cater toward text content?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 15 Oct 2018 21:09:03 GMT</pubDate>
    <dc:creator>kbala</dc:creator>
    <dc:date>2018-10-15T21:09:03Z</dc:date>
    <item>
      <title>Does Alfresco provide checksum for managed content to aid detecting duplicates?</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50201#M19230</link>
      <description>I am looking to save a checksum for managed content. We have multiple sources that save images to alfresco and unfortunately, we end up housing a lot of duplicates. Looking&amp;nbsp;into ways that will alleviate the problem.</description>
      <pubDate>Fri, 12 Oct 2018 18:48:48 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50201#M19230</guid>
      <dc:creator>kbala</dc:creator>
      <dc:date>2018-10-12T18:48:48Z</dc:date>
    </item>
    <item>
      <title>Re: Does Alfresco provide checksum for managed content to aid detecting duplicates?</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50202#M19231</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Not out of the box, but you can add it easily. I did something similar for another client. You can create a behavior that computes a hash on the content stream every time it is updated, and store that hash as a property on the content. Then, finding duplicates is just a matter of running a search for all documents that have that same hash value.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I think I saw that version 6.x added something related to checksums but I have not investigated to see if it is similar to what I describe above.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 12 Oct 2018 20:07:44 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50202#M19231</guid>
      <dc:creator>jpotts</dc:creator>
      <dc:date>2018-10-12T20:07:44Z</dc:date>
    </item>
    <item>
      <title>Re: Does Alfresco provide checksum for managed content to aid detecting duplicates?</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50203#M19232</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Jeff is right when he mentions „something related“ in the newer versions you have document fingerprinting.&lt;/P&gt;&lt;P&gt;&lt;A class="link-titled" href="https://docs.alfresco.com/5.2/concepts/fingerprinting.html" title="https://docs.alfresco.com/5.2/concepts/fingerprinting.html" rel="nofollow noopener noreferrer"&gt;Document Fingerprints | Alfresco Documentation&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can also find related documents with fingerprinting.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I saw it first in a tech Talk live - and - again - an excellent article from &lt;B&gt;Andy Hind&lt;/B&gt;‌ about document fingerprints.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.alfresco.com/people/andy1/blog/2017/05/12/document-fingerprints" rel="nofollow noopener noreferrer"&gt;https://community.alfresco.com/people/andy1/blog/2017/05/12/document-fingerprints&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Maybe this helps...&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 13 Oct 2018 16:34:17 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50203#M19232</guid>
      <dc:creator>mehe</dc:creator>
      <dc:date>2018-10-13T16:34:17Z</dc:date>
    </item>
    <item>
      <title>Re: Does Alfresco provide checksum for managed content to aid detecting duplicates?</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50204#M19233</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thank you. Before we implemented something ourselves that will save the hashes, I wanted to see if Alfresco had something to offer before we tried to reinvent the wheel. Looks like we have v5.2 and I am not sure if an upgrade is pending and we might not be able to use the Document Fingerprint option yet.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 15 Oct 2018 20:42:07 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50204#M19233</guid>
      <dc:creator>kbala</dc:creator>
      <dc:date>2018-10-15T20:42:07Z</dc:date>
    </item>
    <item>
      <title>Re: Does Alfresco provide checksum for managed content to aid detecting duplicates?</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50205#M19234</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I am not finding much documentation on fingerprinting of image and other media content. Any idea if this has been designed to cater toward text content?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 15 Oct 2018 21:09:03 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50205#M19234</guid>
      <dc:creator>kbala</dc:creator>
      <dc:date>2018-10-15T21:09:03Z</dc:date>
    </item>
    <item>
      <title>Re: Does Alfresco provide checksum for managed content to aid detecting duplicates?</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50206#M19235</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Fingerprinting was designed for text only.&lt;/P&gt;&lt;P&gt;If you can turn your image into a text representation than you can use it.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Andy&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 14 Nov 2018 13:55:17 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/does-alfresco-provide-checksum-for-managed-content-to-aid/m-p/50206#M19235</guid>
      <dc:creator>andy1</dc:creator>
      <dc:date>2018-11-14T13:55:17Z</dc:date>
    </item>
  </channel>
</rss>

