<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Need opinions on data model in Alfresco Archive</title>
    <link>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249479#M202609</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;I'm designing a data model with following requirements:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;1) Documents should be associated to a "person", that in Alfresco will by just a numerical ID. All other infos are stored in an external DB.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2) Some docs can be aassociated to more than one person.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;3) I will need to manage 3-4 millions of docs&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4) Documents associciated to a single person usually are below 100&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;5) I need to retireve documents just for a fixed person (ID).&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;6) Documents have other properties (like date, category, …) that I eventually use for stats, not for search.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I think the straight way to model it is a multi-value property on docs, but I have some concerns about performance.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Without requirement 2, a good trick&amp;nbsp; would have been to store contents in a folder structure where all docs of a single person are filed on same folder: in that a case a path search would have make all the work with no concerns on performance…&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Any better solution? &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Cesco&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 24 Jun 2011 17:02:50 GMT</pubDate>
    <dc:creator>cesco75</dc:creator>
    <dc:date>2011-06-24T17:02:50Z</dc:date>
    <item>
      <title>Need opinions on data model</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249479#M202609</link>
      <description>Hi.I'm designing a data model with following requirements:1) Documents should be associated to a "person", that in Alfresco will by just a numerical ID. All other infos are stored in an external DB.2) Some docs can be aassociated to more than one person.3) I will need to manage 3-4 millions of docs4</description>
      <pubDate>Fri, 24 Jun 2011 17:02:50 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249479#M202609</guid>
      <dc:creator>cesco75</dc:creator>
      <dc:date>2011-06-24T17:02:50Z</dc:date>
    </item>
    <item>
      <title>Re: Need opinions on data model</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249480#M202610</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;I see three options:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;1. Use a multi-value property that stores the person identifier (as you propose).&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;2. Use an association between the document object and an object representing the person.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;3. Use a multi-value property on an object representing the person that stores the node refs of the docs.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The multi-value property approach will be more efficient from a search standpoint. Plus, it doesn't sound like you are storing anything about a person other than their identifier, so no need to waste an object on a person, which rules out options 2 and 3.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;You said that a give person will only have about 100 documents related to them and that a given document could be related to more than one person. But how many people might a specific document be related to?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Jeff&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Jun 2011 16:37:35 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249480#M202610</guid>
      <dc:creator>jpotts</dc:creator>
      <dc:date>2011-06-28T16:37:35Z</dc:date>
    </item>
    <item>
      <title>Re: Need opinions on data model</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249481#M202611</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Sorry I did't answer before and thanks for your suggestions:&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;You said that a give person will only have about 100 documents related to them and that a given document could be related to more than one person. But how many people might a specific document be related to?&lt;BR /&gt;Jeff&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;&lt;SPAN&gt;98% of times 1 person is related to just one document but sometimes 1 document is related to a large number of people: tens to thousands in my current understanding&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Cesco&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 20 Jul 2011 13:34:22 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249481#M202611</guid>
      <dc:creator>cesco75</dc:creator>
      <dc:date>2011-07-20T13:34:22Z</dc:date>
    </item>
    <item>
      <title>Re: Need opinions on data model</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249482#M202612</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Your multi-valued field containing tens to thousands should be O.K.&amp;nbsp;&amp;nbsp; But if it goes much above that I'd question the requirements.&amp;nbsp; In addition there is a Lucene limit of 1000 docs for a search.&amp;nbsp;&amp;nbsp; I'd not like to see 10,000 or more values in a multi-valued field. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Where large numbers of users are associated with a document tends to be because they have a role rather than individual responsibility.&amp;nbsp; You could perhaps model that through group membership.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I'd be tempted to try to model the relationship with assocs (Jeff's option 2).&amp;nbsp;&amp;nbsp; Just my preference &lt;img id="smileyhappy" class="emoticon emoticon-smileyhappy" src="https://connect.hyland.com/i/smilies/16x16_smiley-happy.png" alt="Smiley Happy" title="Smiley Happy" /&gt;&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 20 Jul 2011 13:55:50 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249482#M202612</guid>
      <dc:creator>mrogers</dc:creator>
      <dc:date>2011-07-20T13:55:50Z</dc:date>
    </item>
    <item>
      <title>Re: Need opinions on data model</title>
      <link>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249483#M202613</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;How did you solve this finally? I have an exact same requirement. We are building a Policy Management site in Alfresco. Each policy document needs to know who all agreed to that policy, so that when a user logs in, he will only see the pending policies for him. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The no&lt;img id="smileysurprised" class="emoticon emoticon-smileysurprised" src="https://connect.hyland.com/i/smilies/16x16_smiley-surprised.png" alt="Smiley Surprised" title="Smiley Surprised" /&gt;f users who agreed to a policy could be equvilaent to the no&lt;img id="smileysurprised" class="emoticon emoticon-smileysurprised" src="https://connect.hyland.com/i/smilies/16x16_smiley-surprised.png" alt="Smiley Surprised" title="Smiley Surprised" /&gt;f employees in the company which would be 5K to 10K. From a performance stand point, whats the best option multi-valued property or an association between person object? Another suggestion I got the from the forum was to use Preferences service to store the list of policies agreed by a user but it has some cons. &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;Thanks&lt;/SPAN&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 17 Aug 2012 21:54:13 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-archive/need-opinions-on-data-model/m-p/249483#M202613</guid>
      <dc:creator>unknown-user</dc:creator>
      <dc:date>2012-08-17T21:54:13Z</dc:date>
    </item>
  </channel>
</rss>

