<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Info required on Word (docx) parsing in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/info-required-on-word-docx-parsing/m-p/282661#M235791</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Can you please let me know whether Alfresco provides utility classes to parse word document (.docx). &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;We have a requirement wherein users need to&amp;nbsp; upload content ( as per given templates). Once uploaded, the logic needs to be parse the document and display in UI ( we need to map specific fields in UI to the document).&amp;nbsp; Also the contents can be edited and needs to be written back to the document so that the master copy can be downloaded anytime.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Appreciate your help to know whether Alfresco provides apis for the above.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Ram&lt;/SPAN&gt;&lt;BR /&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 03 Apr 2014 06:56:32 GMT</pubDate>
    <dc:creator>ram</dc:creator>
    <dc:date>2014-04-03T06:56:32Z</dc:date>
    <item>
      <title>Info required on Word (docx) parsing</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/info-required-on-word-docx-parsing/m-p/282661#M235791</link>
      <description>HiCan you please let me know whether Alfresco provides utility classes to parse word document (.docx). We have a requirement wherein users need to&amp;nbsp; upload content ( as per given templates). Once uploaded, the logic needs to be parse the document and display in UI ( we need to map specific fields in</description>
      <pubDate>Thu, 03 Apr 2014 06:56:32 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/info-required-on-word-docx-parsing/m-p/282661#M235791</guid>
      <dc:creator>ram</dc:creator>
      <dc:date>2014-04-03T06:56:32Z</dc:date>
    </item>
    <item>
      <title>Re: Info required on Word (docx) parsing</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/info-required-on-word-docx-parsing/m-p/282662#M235792</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hello,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Alfresco includes libraries such as Apache Tika, which are able to extract data from documents such as the metadata / custom XML in Office documents. As far as I know, Apache Tika also provides components to embed data back into documents although I am not sure if the necessary Tika version is already included in Alfresco or if this covers Office metadata / custom XML yet.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;As an open platform, you can always add other libraries such as docx4j, which we use in several customer projects to read and generate Office (OOXML) documents.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Axel&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 03 Apr 2014 08:14:38 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/info-required-on-word-docx-parsing/m-p/282662#M235792</guid>
      <dc:creator>afaust</dc:creator>
      <dc:date>2014-04-03T08:14:38Z</dc:date>
    </item>
  </channel>
</rss>

