<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to automatically get data from alfresco and store in vector database like Chromadb in Alfresco Forum</title>
    <link>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490498#M40215</link>
    <description>&lt;P&gt;Hi, I have to build a Chatbot that allows to upload and search files in Alfresco using LLM. I need some help.&lt;/P&gt;&lt;P&gt;I'm currently proposing following architecture bellow.&lt;/P&gt;&lt;P&gt;I want to use Angular in Frontend and Spring Boot in Backend.&amp;nbsp; &amp;nbsp;Then I use a MCP Server (PythonSDK version) that allows to connect to Alfresco like datasource. So I can upload file and send from Spring Boot through MCP Server to Alfresco. After storing files in Alfresco I have to vectorize and store in a vector database like Chromadb. Which would allow me to do a search from the expression provided from the frontend.&lt;/P&gt;&lt;P&gt;I use Ollama on purpose to use LLM in local. Also I use langchain with Spring Boot on purpose to implement LLM Api.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't know yet how to automatically store files from Alfresco to ChromaDB.&lt;/P&gt;&lt;P&gt;So I would like to know if it's the right approach.&amp;nbsp; &amp;nbsp;Because the main purpose is to search and find by an expression contained in a file storing in datasource like Alfresco and ChromaDB.&lt;/P&gt;&lt;P&gt;Thanks !&lt;/P&gt;</description>
    <pubDate>Sun, 08 Jun 2025 11:27:52 GMT</pubDate>
    <dc:creator>sanogo</dc:creator>
    <dc:date>2025-06-08T11:27:52Z</dc:date>
    <item>
      <title>How to automatically get data from alfresco and store in vector database like Chromadb</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490498#M40215</link>
      <description>&lt;P&gt;Hi, I have to build a Chatbot that allows to upload and search files in Alfresco using LLM. I need some help.&lt;/P&gt;&lt;P&gt;I'm currently proposing following architecture bellow.&lt;/P&gt;&lt;P&gt;I want to use Angular in Frontend and Spring Boot in Backend.&amp;nbsp; &amp;nbsp;Then I use a MCP Server (PythonSDK version) that allows to connect to Alfresco like datasource. So I can upload file and send from Spring Boot through MCP Server to Alfresco. After storing files in Alfresco I have to vectorize and store in a vector database like Chromadb. Which would allow me to do a search from the expression provided from the frontend.&lt;/P&gt;&lt;P&gt;I use Ollama on purpose to use LLM in local. Also I use langchain with Spring Boot on purpose to implement LLM Api.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't know yet how to automatically store files from Alfresco to ChromaDB.&lt;/P&gt;&lt;P&gt;So I would like to know if it's the right approach.&amp;nbsp; &amp;nbsp;Because the main purpose is to search and find by an expression contained in a file storing in datasource like Alfresco and ChromaDB.&lt;/P&gt;&lt;P&gt;Thanks !&lt;/P&gt;</description>
      <pubDate>Sun, 08 Jun 2025 11:27:52 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490498#M40215</guid>
      <dc:creator>sanogo</dc:creator>
      <dc:date>2025-06-08T11:27:52Z</dc:date>
    </item>
    <item>
      <title>Re: How to automatically get data from alfresco and store in vector database like Chromadb</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490500#M40216</link>
      <description>&lt;P&gt;You have different samples on every missing step:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Ingestion in Vector DB (like ChromaDB) with Spring AI:&amp;nbsp;&lt;A href="https://github.com/aborroy/alfresco-knowledge-enrichment" target="_blank"&gt;https://github.com/aborroy/alfresco-knowledge-enrichment&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Synchronization of Alfresco Contents with Vector DB in&amp;nbsp;&lt;A href="https://github.com/aborroy/alfresco-ai-framework" target="_blank"&gt;https://github.com/aborroy/alfresco-ai-framework&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Hope this helps.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 09 Jun 2025 06:07:39 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490500#M40216</guid>
      <dc:creator>angelborroy</dc:creator>
      <dc:date>2025-06-09T06:07:39Z</dc:date>
    </item>
    <item>
      <title>Re: How to automatically get data from alfresco and store in vector database like Chromadb</title>
      <link>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490502#M40217</link>
      <description>&lt;P&gt;Hi, thank you for your quit response. This solves my problem.&lt;/P&gt;&lt;P&gt;I will use&amp;nbsp;Synchronization of Alfresco Contents with Vector DB in&amp;nbsp;&lt;A href="https://github.com/aborroy/alfresco-ai-framework" target="_blank" rel="nofollow noopener noreferrer"&gt;https://github.com/aborroy/alfresco-ai-framework.&lt;/A&gt;&lt;/P&gt;&lt;P&gt;But can I use &lt;STRONG&gt;Angular for UI&lt;/STRONG&gt; rather than &lt;STRONG&gt;Alfresco-AI-UI&lt;/STRONG&gt;&amp;nbsp;?&lt;/P&gt;&lt;P&gt;I think to the following architecture:&lt;/P&gt;&lt;P&gt;-&amp;nbsp;&lt;STRONG&gt;Alfresco&lt;/STRONG&gt; acts as the &lt;STRONG&gt;Knowledge Base&lt;/STRONG&gt;, storing documents.&lt;/P&gt;&lt;P&gt;-&amp;nbsp;&lt;STRONG&gt;alfresco-ai-sync&lt;/STRONG&gt; listens to changes in sync folder from the Alfresco Repository and updates the content to the &lt;STRONG&gt;Vector Database&lt;/STRONG&gt; through the &lt;STRONG&gt;AI RAG Framework&lt;/STRONG&gt; REST API.&lt;/P&gt;&lt;P&gt;-&amp;nbsp;&lt;STRONG&gt;AI-RAG-Framework&lt;/STRONG&gt; generates vector representations of the ingested documents and stores them in &lt;STRONG&gt;Elasticsearch or ChromaDB, &lt;/STRONG&gt;exposes the REST API.&amp;nbsp;Then, based on request from &lt;STRONG&gt;Angular UI&lt;/STRONG&gt;, we search in vector database and send response to &lt;STRONG&gt;Angular UI&lt;/STRONG&gt;. We use Ollama&amp;nbsp;which generates vector representations.&amp;nbsp;Also we can send file to store in &lt;STRONG&gt;Alfresco &lt;/STRONG&gt;through &lt;STRONG&gt;AI-RAG-Framework&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;- &lt;STRONG&gt;Angular&lt;/STRONG&gt; for user interface that allows user to upload and store a file in &lt;STRONG&gt;Alfresco&lt;/STRONG&gt; by sending through &lt;STRONG&gt;AI-RAG-Framework&lt;/STRONG&gt;. Then we can send request through&amp;nbsp;&lt;STRONG&gt;AI-RAG-Framework &lt;/STRONG&gt;using &lt;STRONG&gt;langchain4j&lt;/STRONG&gt;&amp;nbsp;for searching in &lt;STRONG&gt;Elasticsearch&lt;/STRONG&gt; or &lt;STRONG&gt;ChromaDB&lt;/STRONG&gt;.&lt;/P&gt;</description>
      <pubDate>Mon, 09 Jun 2025 16:17:54 GMT</pubDate>
      <guid>https://connect.hyland.com/t5/alfresco-forum/how-to-automatically-get-data-from-alfresco-and-store-in-vector/m-p/490502#M40217</guid>
      <dc:creator>sanogo</dc:creator>
      <dc:date>2025-06-09T16:17:54Z</dc:date>
    </item>
  </channel>
</rss>

