Updated RAG tasks (#220)

This commit is contained in:
Thorsten Sommer 2024-11-23 13:35:42 +01:00 committed by GitHub
parent cf8a9800f3
commit 99fe9f3a4f
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -12,11 +12,13 @@ Things we are currently working on:
- [ ] App: Configure embedding providers
- [ ] App: Management of data sources (local & external data via [EDI](https://github.com/MindWorkAI/EDI))
- [ ] Runtime: Extract data from txt / md / pdf / docx / xlsx files
- [ ] App: Implement embedding providers
- [ ] Runtime: Implement internal embedding provider through [fastembed-rs](https://github.com/Anush008/fastembed-rs)
- [ ] App: Implement external embedding providers
- [ ] App: Implement the process to vectorize one local file using embeddings
- [ ] Runtime: Integration of the vector database [LanceDB](https://github.com/lancedb/lancedb)
- [ ] App: Implement the continuous process of vectorizing data
- [ ] App: Define an interface for the integration of RAG processes in chats
- [ ] App: Define a common retrieval context interface for the integration of RAG processes in chats
- [ ] App: Define a common augmentation interface for the integration of RAG processes in chats
- [ ] App: Integrate data sources in chats