RAG Configuration Resource

The RAG Configuration Connection Shared Resource specifies the configuration required to connect with an external AI model service and manage Retrieval-Augmented Generation (RAG) workflows. Through this resource, users can select the language model service (such as, OpenAI or Ollama), choose an embedding model to generate vector representations, and configure the vector database used for storing and retrieving embeddings.

General

The General panel contains the following fields.

Property Module Property? Yes/No Description
Package No The name of the package in which you want to create a shared resource.
Name No The name to be displayed as the label for the shared resource in the process.
Description No Provide a short description for this shared resource.

Configuration

The Configuration panel contains the following fields.

Property Module Property? Yes/No Description
LLM Selection  

The following are the supported LLMs:

  • Ollama

  • OpenAI

Ollama Connection
Ollama Endpoint Yes The field indicates the endpoint URL of the Ollama service used to process model requests.
Model Name Yes The field indicates the name of the Ollama model used to generate responses.
Open AI Connection
OpenAI LLM Model Selection Yes The field is to identify the OpenAI Language Model to generate responses from retrieved data.
OpenAI API Key Yes The field indicates the API key used to authenticate with the OpenAI service.
Temperature Yes The field indicates the level of randomness incorporated when generating responses.
Max Tokens Yes The field indicates the maximum number of tokens permitted in the generated response.
Top P Yes The field indicates the cumulative probability threshold that restricts token selection.
Frequency Penality Yes The field indicates the penalty applied to reduce repetition of words or phrases in generated response.
Presence Penality Yes The field indicates the penalty applied to encourage the introduction of new topics in the generated response.
Test Connection   Click the Test Connection button to verify that the configuration can establish a successful connection to the OpenAI service.
Vector Database Configuration
Embedding Model No

The field indicates the embedding model provider for generating vector representations. Select the OpenAI for OpenAI embedding model, or Ollama for Ollama embedding model.

OpenAI Embedding Endpoint Yes The field indicates the URL for a custom OpenAI embedding endpoint. If OpenAI is chosen as the LLM, it automatically applies the default end point when the field is left blank.
OpenAI Embedding Model Name No

The field indicates the OpenAI embedding model used to generate vector representations.

The following are the supported embedding models:

  • text-embedding-3-large

  • text-embedding-3-small

  • text-embedding-ada-002

Embedding Ollama Endpoint Yes The field indicates the endpoint URL of the Ollama service running locally to generate embedding.
Embedding Ollama Model Name No

The field indicates the Ollama embedding model used to generate vector representations.

The following are the supported embedding models:

  • all-mililm

  • nomic-embed-text

Vector Database Storage No

The field indicates the vector database storage that manages the storage and retrieval of embeddings.

The following are the supported storage options:

  • local

  • Weaviate

local

Configure the following field when you select the local database storage.

Storage File Yes

The field indicates the local file system directory for storing vectors.

Note: The above field is enabled only when the Local is selected as the vector database storage.

Weaviate

Configure the following fields when you select the Weaviate database storage.

API Key Yes

The field indicates the API key required for authentication with the Weaviate database.

Host Yes The field indicates the host name or IP address of the Weaviate server.
Port Yes The field indicates the port used to connect to the Weaviate server.
Scheme No

The dropdown indicates the communication scheme for Weaviate connection.

The following are the supported values:

  • HTTP

  • HTTPS

Allow Duplicates No The checkbox indicates if the Weaviate database permits duplicate vectors.
Objectclass Yes The field indicates the object class used to store vectors in Weaviate.
Consistency No

The dropdown indicates the consistency level configured for Weaviate operations. Following are the supported values:

  • QUORUM

  • ONE

  • ALL

Textfield Name Yes The field indicates the name of the text field to store text content in Weaviate.
Metadata Field Name Yes The field indicates the name of the metadata field in Weaviate.
Metadata Keys Yes The field indicates the keys of metadata fields to be stored or retrieved in Weaviate.