<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: UnsupportedTranformationException in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4285#M1851</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It could be considered a bug for the PDF to HTML transformer that it produces XHMTL instead, correct. Unfortunately I don't know which converter is responsible for that (no one usually ever does this type of conversion). If you have the &lt;A href="https://github.com/OrderOfTheBee/ootbee-support-tools" rel="nofollow noopener noreferrer"&gt;OOTBee Support Tools&lt;/A&gt; addon installed, you can use its Admin Console tool for transformations to find out which transformer is being used.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 30 Jan 2017 06:49:04 GMT</pubDate>
    <dc:creator>afaust</dc:creator>
    <dc:date>2017-01-30T06:49:04Z</dc:date>
    <item>
      <title>UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4278#M1844</link>
      <description>Hi,I have an issue with the transformation from PDF to HTML.Here is the Rule:https://i.imgur.com/4uFsKZr.png"Caused by: org.alfresco.repo.content.transform.UnsupportedTransformationException: 00270053 Transformation of (pdfdocname.html) has not taken place because thedeclared mimetype (text/html) do</description>
      <pubDate>Fri, 27 Jan 2017 19:21:03 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4278#M1844</guid>
      <dc:creator>ksh</dc:creator>
      <dc:date>2017-01-27T19:21:03Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4279#M1845</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I really am not sure that this message relates directly to the rule you have included in the screenshot. In the error message it complains about a transformation that is supposed to be from some HTML file to some target, where the source file is defined as text/html while Alfresco / TIKA detects it as being application/xhtml+xml. Unfortunately due to the various versions of (X)HTML, file naming and sloppy content structures, HTML files can easily be misclassified as XHTML and vice versa.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 28 Jan 2017 13:59:21 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4279#M1845</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-01-28T13:59:21Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4280#M1846</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;STRONG&gt;UPDATE:&lt;/STRONG&gt; I just figured that &lt;EM&gt;some&lt;/EM&gt; PDFs are correctly converted &lt;SPAN style="text-decoration: underline;"&gt;BUT&lt;/SPAN&gt;&amp;nbsp;cannot be displayed&amp;nbsp;most likely due to the wrong mimetype detection:&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;$ file 2026140_v\=1.html &lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;2026140_v=1.html: HTML document text, UTF-8 Unicode text&lt;/SPAN&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;This file actually contains readable PDF text. However, if I do a&amp;nbsp;OCRd version conversion the HTML output is 1K, aka zero.&lt;/P&gt;&lt;P&gt;This implies another &amp;nbsp;problem.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hi Axel,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;this error message appears once I click on the generated HTML file from the PDF in the "Transformed" folder.&amp;nbsp;However, I have no transformation rule in that folder to further convert the HTML hence this is what should have already happened before. So the HTML is only &lt;STRONG&gt;1K&lt;/STRONG&gt; of size. I&amp;nbsp;even tried to disable the transformation check with&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&lt;SPAN class=""&gt;transformer.strict.mimetype.check=false&lt;/SPAN&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;which didn't help either. This is what appears when trying to load along with the error message above:&lt;/P&gt;&lt;P&gt;&lt;IMG __jive_id="12212" class="image-1 jive-image" src="https://connect.hyland.com/legacyfs/online/alfresco/12212_pastedImage_2.png" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there a chance that the generated HTML from the PDF transforms into a (X)HTML und thus is misclassified?&lt;/P&gt;&lt;P&gt;Furthermore, editing doesn't&amp;nbsp;show any text&amp;nbsp;either.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 29 Jan 2017 11:21:49 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4280#M1846</guid>
      <dc:creator>ksh</dc:creator>
      <dc:date>2017-01-29T11:21:49Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4281#M1847</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I would not classify 1K as zero.&lt;/P&gt;&lt;P&gt;But it looks like the conversion of PDF to HTML results in a XHTML classified file that only has the HTML mimetype associated with it. The mimetype handling that treats HTML different from XHTML might be a bit pedantic here, since most people likely aren't even aware of the differences and would throw these types into the same basket.&lt;/P&gt;&lt;P&gt;Disabling the strict mimetype check should have worked - there is a very direct check of this setting in the codebase.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 29 Jan 2017 14:43:04 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4281#M1847</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-01-29T14:43:04Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4282#M1848</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;So i've been digging and found this:&lt;/P&gt;&lt;P&gt;&lt;IMG class="image-1 jive-image" src="https://connect.hyland.com/legacyfs/online/alfresco/12213_pastedImage_1.png" /&gt;&lt;/P&gt;&lt;P&gt;XHTML produces at its head:&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P&gt;&lt;SPAN&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&amp;lt;html xmlns="&lt;/SPAN&gt;&lt;A class="jive-link-external-small" href="http://www.w3.org/1999/xhtml" rel="nofollow noopener noreferrer" target="_blank"&gt;http://www.w3.org/1999/xhtml&lt;/A&gt;&lt;SPAN&gt;"&amp;gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;It seems that the mimetype check is broken as well.&lt;/P&gt;&lt;BLOCKQUOTE class="jive_macro_quote jive-quote jive_text_macro"&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;$ file 2026140_v\=1.*&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;2026140_v=1.html:&lt;SPAN class=""&gt;&amp;nbsp; &lt;/SPAN&gt;HTML document text, UTF-8 Unicode text&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;2026140_v=1.xhtml: XML 1.0 document text, UTF-8 Unicode text&lt;/SPAN&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;Nevertheless, this is implies an old error that XHTML cannot be previewed, which I found here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://issues.alfresco.com/jira/browse/ALF-18696" rel="nofollow noopener noreferrer"&gt;https://issues.alfresco.com/jira/browse/ALF-18696&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 29 Jan 2017 15:57:10 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4282#M1848</guid>
      <dc:creator>ksh</dc:creator>
      <dc:date>2017-01-29T15:57:10Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4283#M1849</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Yeah - this was part of the issue in ALF-18696 but as far as I can see from the comments they just fixed the issue that affected the wiki display of XHTML, and not the transformation of XHTML to a previewable PDF document. So that part of ALF-18696 is still very much an "unresolved issue".&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The mimetype check in Alfresco is slightly more elaborate than what the command line "file" utility does. It will actually check the content and not the file name. The check for XHTML files does multiple different checks, one is to check for a "xmlns" attribute on the "html" tag. In the case of your file it is present in the "&lt;SPAN class=""&gt;2026140_v=1.html&lt;/SPAN&gt;" so the mimetype check treats it as XHTML, while the original mimetype (in the content URL) was determined by the PDF-to-HTML conversion. That conversion should actually set XHTML as the mimetype (and extension) so that there is no mismatch. That would then only leave the only remaining issue: lack of XHTML preview support in Alfresco.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 29 Jan 2017 22:23:29 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4283#M1849</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-01-29T22:23:29Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4284#M1850</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I partially agree up to the point that I wanted PDF-to-HTML conversion and not PDF-to-XHTML. XHTML would be fine but for further processing XHTML to DOC is not supported.&amp;nbsp;Should this be filed as a bug?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 30 Jan 2017 00:39:39 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4284#M1850</guid>
      <dc:creator>ksh</dc:creator>
      <dc:date>2017-01-30T00:39:39Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4285#M1851</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It could be considered a bug for the PDF to HTML transformer that it produces XHMTL instead, correct. Unfortunately I don't know which converter is responsible for that (no one usually ever does this type of conversion). If you have the &lt;A href="https://github.com/OrderOfTheBee/ootbee-support-tools" rel="nofollow noopener noreferrer"&gt;OOTBee Support Tools&lt;/A&gt; addon installed, you can use its Admin Console tool for transformations to find out which transformer is being used.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 30 Jan 2017 06:49:04 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4285#M1851</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2017-01-30T06:49:04Z</dc:date>
    </item>
    <item>
      <title>Re: UnsupportedTranformationException</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4286#M1852</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;ok. let me check if I find the amp file for the OOTBee tools. Thx!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 30 Jan 2017 10:43:10 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/unsupportedtranformationexception/m-p/4286#M1852</guid>
      <dc:creator>ksh</dc:creator>
      <dc:date>2017-01-30T10:43:10Z</dc:date>
    </item>
  </channel>
</rss>

