RAG
RAG feature uses the following variables
RAG_EMBEDDING_MODEL
Model used to embed text to vectors are provided by RAG_EMBEDDING_MODEL env variable.
Option 1 (OpenAI):
RAG_EMBEDDING_MODEL=text-embedding-3-small
# or #
RAG_EMBEDDING_MODEL=text-embedding-3-large
# or #
RAG_EMBEDDING_MODEL=text-embedding-ada-002
If you want to use OPEN_API, please ensure you have the OPENAI env variables configured. You can find the details in section : OpenAI Setup
Option 2 (AWS titan):
RAG_EMBEDDING_MODEL=amazon.titan-embed-text-v2:0
If you want to use AWS titan model, please ensure you have the AWS env variables configured. You can find the details in section : AWS Setup
RAG_VECTOR_STORE_TYPE
Database used to store the vectors are provided by RAG_VECTOR_STORE_TYPE env variable.
Option 1 (duckdb):
Uses local file system to store the vectors. No external or docker container dependencies required. Good for quick testing.
RAG_VECTOR_STORE_TYPE=duckdb
Option 2 (pinecone):
Uses Pinecone cloud database to store the vectors.
RAG_VECTOR_STORE_TYPE=pinecone
If you want to use pinecone as the store type, please ensure you have the Pinecone env variables configured. You can find the details in section : Pinecone Setup
Option 3 (mongodb):
Uses Mongodb cloud database to store the vectors.
RAG_VECTOR_STORE_TYPE=mongodb
If you want to use pinecone as the store type, please ensure you have the Mongodb env variables configured. You can find the details in section : Mongodb Setup
Local mongodb does not support atlas search which is required for embeddings based searches for RAG.