From f92fab24b3ccca4c60854fd72ab8505e88b5eed5 Mon Sep 17 00:00:00 2001 From: Thorsten Sommer Date: Fri, 8 Nov 2024 22:49:09 +0100 Subject: [PATCH] Updated RAG tasks (#198) --- README.md | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 0a4f1d6..5d73921 100644 --- a/README.md +++ b/README.md @@ -6,13 +6,14 @@ Things we are currently working on: - Since November 2024: Work on RAG (integration of your data and files) has begun. We will support the integration of local and external data sources. We need to implement the following runtime (Rust) and app (.NET) steps: - [x] ~~Runtime: Restructuring the code into meaningful modules (PR [#192](https://github.com/MindWorkAI/AI-Studio/pull/192))~~ + - [x] ~~Define the [External Data API (EDI)](https://github.com/MindWorkAI/EDI) as a contract for integrating arbitrary external data (PR [#1](https://github.com/MindWorkAI/EDI/pull/1))~~ - [ ] App: Metadata for providers (which provider offers embeddings?) - [ ] App: Management of data sources (local data) - - [ ] Runtime: Integration of the vector database [LanceDB](https://github.com/lancedb/lancedb) - [ ] Runtime: Extract data from txt / md / pdf / docx / xlsx files + - [ ] App: Implement embedding providers + - [ ] App: Implement the process to vectorize local data using embeddings + - [ ] Runtime: Integration of the vector database [LanceDB](https://github.com/lancedb/lancedb) - [ ] App: Define an interface for the integration of RAG processes in chats - - [x] ~~Define the [External Data API (EDI)](https://github.com/MindWorkAI/EDI) as a contract for integrating arbitrary external data (PR [#1](https://github.com/MindWorkAI/EDI/pull/1))~~ - - [ ] App: Implement the process control of vectorizing local data - [ ] App: Integrate data sources in chats - [ ] App: Management of data sources (external data via [EDI](https://github.com/MindWorkAI/EDI))