From dea164085d42bcaaf8f23f5b61cd816b2b55dc79 Mon Sep 17 00:00:00 2001 From: Thorsten Sommer Date: Fri, 8 Nov 2024 22:48:25 +0100 Subject: [PATCH] Updated RAG tasks --- README.md | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 0a4f1d68..5d73921b 100644 --- a/README.md +++ b/README.md @@ -6,13 +6,14 @@ Things we are currently working on: - Since November 2024: Work on RAG (integration of your data and files) has begun. We will support the integration of local and external data sources. We need to implement the following runtime (Rust) and app (.NET) steps: - [x] ~~Runtime: Restructuring the code into meaningful modules (PR [#192](https://github.com/MindWorkAI/AI-Studio/pull/192))~~ + - [x] ~~Define the [External Data API (EDI)](https://github.com/MindWorkAI/EDI) as a contract for integrating arbitrary external data (PR [#1](https://github.com/MindWorkAI/EDI/pull/1))~~ - [ ] App: Metadata for providers (which provider offers embeddings?) - [ ] App: Management of data sources (local data) - - [ ] Runtime: Integration of the vector database [LanceDB](https://github.com/lancedb/lancedb) - [ ] Runtime: Extract data from txt / md / pdf / docx / xlsx files + - [ ] App: Implement embedding providers + - [ ] App: Implement the process to vectorize local data using embeddings + - [ ] Runtime: Integration of the vector database [LanceDB](https://github.com/lancedb/lancedb) - [ ] App: Define an interface for the integration of RAG processes in chats - - [x] ~~Define the [External Data API (EDI)](https://github.com/MindWorkAI/EDI) as a contract for integrating arbitrary external data (PR [#1](https://github.com/MindWorkAI/EDI/pull/1))~~ - - [ ] App: Implement the process control of vectorizing local data - [ ] App: Integrate data sources in chats - [ ] App: Management of data sources (external data via [EDI](https://github.com/MindWorkAI/EDI))