Add Milvus database compatibility with the RAG recipe #334

Shreyanand · 2024-04-24T21:40:39Z

This PR adds the following changes:

Adds a shell script to deploy a standalone Milvus container on the local system
Seperate the vector db management logic from rag_chat.py file
Add a logic to switch between Milvus or Chromadb depending on what is being used

@MichaelClifford I need to add documentation around this but I have tested it on my system and it works. What should be the next things to check to make sure this is polished?

rhatdan · 2024-04-25T17:47:51Z

@Shreyanand you need to sign your commits.

git commit -a --amend -s
git push --force

rhatdan · 2024-04-25T17:48:05Z

@MichaelClifford PTAL

recipes/natural_language_processing/rag/app/manage_vectordb.py

recipes/natural_language_processing/rag/app/rag_app.py

vector_dbs/milvus/milvus_standalone_embed.sh

MichaelClifford · 2024-04-25T19:08:57Z

@Shreyanand You will also need to update recipes/natural_language_processing/rag/app/Containerfile to include your new manage_vectordb.py file

Gregory-Pereira

Did not have time to actually run the rag and make sure it works, I just focused on the DB parts and setting up the Makefile. I defer to Michael for the actual Rag and DB population bits.

vector_dbs/milvus/Makefile

rhatdan · 2024-05-02T17:06:41Z

Needs a rebase.

rhatdan · 2024-05-03T16:29:16Z

Also needs DCO
git commit -a --amend -s
git push --force

Signed-off-by: Shreyanand <shanand@redhat.com> Co-authored-by: Michael Clifford <mcliffor@redhat.com> Co-authored-by: greg pereira <grpereir@redhat.com>

Gregory-Pereira

/lgtm, but 2 tiny nits

.github/workflows/rag.yaml

recipes/natural_language_processing/rag/app/rag_app.py

Gregory-Pereira

/lgtm

Gregory-Pereira · 2024-05-13T16:38:28Z

Commit suggestions dont sign with DCO, major bummer. You will have to rebase those commit with signing. Personally I use:

which gamens
gamens: aliased to git commit --amend --no-edit -S -s

Alternatively we can just set DCO to pass, but I would prefer stuff stay signed if possible.

vector_dbs/milvus/Makefile

Co-authored-by: Gregory Pereira <grpereir@redhat.com> Update recipes/natural_language_processing/rag/app/rag_app.py Co-authored-by: Gregory Pereira <grpereir@redhat.com> Update Readme and add review comments Signed-off-by: Shreyanand <shanand@redhat.com>

MichaelClifford · 2024-05-16T19:19:23Z

recipes/natural_language_processing/rag/README.md

@@ -4,7 +4,7 @@ This demo provides a simple recipe to help developers start to build out their o

 There are a few options today for local Model Serving, but this recipe will use [`llama-cpp-python`](https://github.com/abetlen/llama-cpp-python) and their OpenAI compatible Model Service. There is a Containerfile provided that can be used to build this Model Service within the repo, [`model_servers/llamacpp_python/base/Containerfile`](/model_servers/llamacpp_python/base/Containerfile).

-In order for the LLM to interact with our documents, we need them stored and available in such a manner that we can retrieve a small subset of them that are relevant to our query. To do this we employ a Vector Database alongside an embedding model. The embedding model converts our documents into numerical representations, vectors, such that similarity searches can be easily performed. The Vector Database stores these vectors for us and makes them available to the LLM. In this recipe we will use [chromaDB](https://docs.trychroma.com/) as our Vector Database.
+In order for the LLM to interact with our documents, we need them stored and available in such a manner that we can retrieve a small subset of them that are relevant to our query. To do this we employ a Vector Database alongside an embedding model. The embedding model converts our documents into numerical representations, vectors, such that similarity searches can be easily performed. The Vector Database stores these vectors for us and makes them available to the LLM. In this recipe we can use [chromaDB](https://docs.trychroma.com/) or [Milvus](https://milvus.io/) as our Vector Database.


@Shreyanand Since the recipe is defined in the ai-lab.yaml file and should only have One vectorDB. I think we should only reference Milvus here and move any mentions of chroma to a chroma readme. Also you should update the ai-lab.yaml file to reference this DB instead.

MichaelClifford · 2024-05-16T19:20:34Z

recipes/natural_language_processing/rag/app/rag_app.py

 import os

 model_service = os.getenv("MODEL_ENDPOINT","http://0.0.0.0:8001")
 model_service = f"{model_service}/v1"
 chunk_size = os.getenv("CHUNK_SIZE", 150)
 embedding_model = os.getenv("EMBEDDING_MODEL","BAAI/bge-base-en-v1.5")
+vdb_vendor = os.getenv("VECTORDB_VENDOR", "chromadb")


lets make milvus the new default.

MichaelClifford · 2024-05-16T19:25:24Z

vector_dbs/README.md

We should follow the same format as the model_servers/ remove this doc, and add a README under each vectorDB directory with all the info to run it.

MichaelClifford · 2024-05-16T19:26:23Z

vector_dbs/milvus/Containerfile

@@ -0,0 +1,2 @@
+FROM docker.io/milvusdb/milvus:master-20240426-bed6363f


Suggested change

FROM docker.io/milvusdb/milvus:master-20240426-bed6363f

FROM docker.io/milvusdb/milvus:master-20240516-5b27a0cd

MichaelClifford · 2024-05-16T19:29:38Z

recipes/natural_language_processing/rag/README.md

You should also add to the readme the step needed to run the application with the new vectorDB. I think the Env Variables are a little different now.

Shreyanand requested review from MichaelClifford, rhatdan, sallyom, lmilbaum, cgwalters and Gregory-Pereira as code owners April 24, 2024 21:40

Gregory-Pereira mentioned this pull request Apr 25, 2024

Add milvus vectorDB option for RAG example #74

Closed

MichaelClifford requested changes Apr 25, 2024

View reviewed changes

Gregory-Pereira force-pushed the milvus branch 3 times, most recently from be21e2d to 595054c Compare April 28, 2024 01:49

Gregory-Pereira reviewed Apr 28, 2024

View reviewed changes

vector_dbs/milvus/Makefile Outdated Show resolved Hide resolved

Gregory-Pereira added the hold Do not merge label Apr 28, 2024

Shreyanand force-pushed the milvus branch from c0306f1 to fe53b0a Compare May 3, 2024 16:25

Add milvus vector database for rag recipe

705b5d1

Signed-off-by: Shreyanand <shanand@redhat.com> Co-authored-by: Michael Clifford <mcliffor@redhat.com> Co-authored-by: greg pereira <grpereir@redhat.com>

Shreyanand force-pushed the milvus branch from fe53b0a to 705b5d1 Compare May 3, 2024 16:32

Shreyanand requested review from MichaelClifford and Gregory-Pereira May 13, 2024 14:56

Gregory-Pereira reviewed May 13, 2024

View reviewed changes

.github/workflows/rag.yaml Outdated Show resolved Hide resolved

recipes/natural_language_processing/rag/app/rag_app.py Outdated Show resolved Hide resolved

Gregory-Pereira removed the hold Do not merge label May 13, 2024

Gregory-Pereira approved these changes May 13, 2024

View reviewed changes

MichaelClifford reviewed May 16, 2024

View reviewed changes

vector_dbs/milvus/Makefile Outdated Show resolved Hide resolved

Shreyanand force-pushed the milvus branch from 794891c to ef4b6f0 Compare May 16, 2024 16:53

rhatdan merged commit ae88cd7 into containers:main May 16, 2024
2 checks passed

MichaelClifford reviewed May 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Milvus database compatibility with the RAG recipe #334

Add Milvus database compatibility with the RAG recipe #334

Shreyanand commented Apr 24, 2024

rhatdan commented Apr 25, 2024

rhatdan commented Apr 25, 2024

MichaelClifford commented Apr 25, 2024

Gregory-Pereira left a comment

rhatdan commented May 2, 2024

rhatdan commented May 3, 2024

Gregory-Pereira left a comment

Gregory-Pereira left a comment

Gregory-Pereira commented May 13, 2024 •

edited

MichaelClifford May 16, 2024

MichaelClifford May 16, 2024

MichaelClifford May 16, 2024

MichaelClifford May 16, 2024

MichaelClifford May 16, 2024

		@@ -0,0 +1,2 @@
		FROM docker.io/milvusdb/milvus:master-20240426-bed6363f

Add Milvus database compatibility with the RAG recipe #334

Add Milvus database compatibility with the RAG recipe #334

Conversation

Shreyanand commented Apr 24, 2024

rhatdan commented Apr 25, 2024

rhatdan commented Apr 25, 2024

MichaelClifford commented Apr 25, 2024

Gregory-Pereira left a comment

Choose a reason for hiding this comment

rhatdan commented May 2, 2024

rhatdan commented May 3, 2024

Gregory-Pereira left a comment

Choose a reason for hiding this comment

Gregory-Pereira left a comment

Choose a reason for hiding this comment

Gregory-Pereira commented May 13, 2024 • edited

MichaelClifford May 16, 2024

Choose a reason for hiding this comment

MichaelClifford May 16, 2024

Choose a reason for hiding this comment

MichaelClifford May 16, 2024

Choose a reason for hiding this comment

MichaelClifford May 16, 2024

Choose a reason for hiding this comment

MichaelClifford May 16, 2024

Choose a reason for hiding this comment

Gregory-Pereira commented May 13, 2024 •

edited