<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic XML extration fails on files with DOCTYPE line in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249459#M202589</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;XML extration works fine on files that do not have a DOCTYPE line like e.g.&lt;/SPAN&gt;&lt;BR /&gt;&lt;PRE class="language-none line-numbers"&gt;&lt;CODE&gt;&amp;lt;!DOCTYPE beans PUBLIC '-//SPRING//DTD BEAN//EN' '&lt;A href="http://www.springframework.org/dtd/spring-beans.dtd" rel="nofollow noopener noreferrer"&gt;http://www.springframework.org/dtd/spring-beans.dtd&lt;/A&gt;'&amp;gt;&lt;SPAN class="line-numbers-rows"&gt;&lt;SPAN&gt;‍&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;SPAN&gt;As soon as I trigger the metadata extration of a XML file with such a line, the extration doesn't work any longer. In the log I just get the message:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;DEBUG [org.alfresco.repo.content.metadata.MetadataExtracterRegistry] Finding extractors for text/xml&lt;BR /&gt;DEBUG [org.alfresco.repo.content.metadata.xml.XPathMetadataExtracter]&lt;BR /&gt;No working metadata extractor could be found:&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;I played around with the reference to the DTD file (e.g. '&lt;/SPAN&gt;&lt;A href="http://www.springframework.org/dtd/spring-beans.dtd" rel="nofollow noopener noreferrer"&gt;http://www.springframework.org/dtd/spring-beans.dtd&lt;/A&gt;&lt;SPAN&gt;'), e.g. replacing it by just the filename and saving the DTD file in the same directory as the XML file - unfortunately nothing has helped so far&amp;nbsp; &lt;img id="smileysad" class="emoticon emoticon-smileysad" src="https://connect.hyland.com/i/smilies/16x16_smiley-sad.png" alt="Smiley Sad" title="Smiley Sad" /&gt; &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Any idea?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Is this a bug or a feature?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I'm running Alfresco 3.4b on a Ubuntu 8.04 server.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 19 Jan 2011 09:13:18 GMT</pubDate>
    <dc:creator>chriscms</dc:creator>
    <dc:date>2011-01-19T09:13:18Z</dc:date>
    <item>
      <title>XML extration fails on files with DOCTYPE line</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249459#M202589</link>
      <description>Hello,XML extration works fine on files that do not have a DOCTYPE line like e.g.&amp;lt;!DOCTYPE beans PUBLIC '-//SPRING//DTD BEAN//EN' 'http://www.springframework.org/dtd/spring-beans.dtd'&amp;gt;‍As soon as I trigger the metadata extration of a XML file with such a line, the extration doesn't work any lo</description>
      <pubDate>Wed, 19 Jan 2011 09:13:18 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249459#M202589</guid>
      <dc:creator>chriscms</dc:creator>
      <dc:date>2011-01-19T09:13:18Z</dc:date>
    </item>
    <item>
      <title>Re: XML extration fails on files with DOCTYPE line</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249460#M202590</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Is there really nobody having this problem? Or any idea what might cause the problem?&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 25 Jan 2011 13:03:21 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249460#M202590</guid>
      <dc:creator>chriscms</dc:creator>
      <dc:date>2011-01-25T13:03:21Z</dc:date>
    </item>
    <item>
      <title>Re: XML extration fails on files with DOCTYPE line</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249461#M202591</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;After I opened a ticket it turned out to be really an error. It has been fixed now and will be delivered with one of the next releases.&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 01 Mar 2011 12:30:16 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/xml-extration-fails-on-files-with-doctype-line/m-p/249461#M202591</guid>
      <dc:creator>chriscms</dc:creator>
      <dc:date>2011-03-01T12:30:16Z</dc:date>
    </item>
  </channel>
</rss>

