Architecture#

Context Aware RAG allows the user to start both a Data Ingestion Service and a Data Retrieval Service
- The Data Ingestion Service is responsible for ingesting data into the database
  - Database can be plug and play, Milvus, Neo4j, etc.
- The Data Retrieval Service is responsible for retrieving data from the database
  - Utilizes Graph-RAG or Vector-RAG to retrieve relevant documents from the database to answer the user’s query

Components#

        ---
config:
  theme: dark

---
  flowchart TD
      CM[Context Manager]


      %% Tools used by the Context Manager
      subgraph Tools [Tools]
        MT["MilvusDBTool"]
        NT["Neo4jGraphDB"]
        NOTIF["NotificationTool"]
      end

      %% Functions executed by the Context Manager
      subgraph Functions [Functions]
        CF["ChatFunction"]
        GEF["GraphExtractionFunc"]
        GRF["GraphRetrievalFunc"]
        VRF["VectorRetrievalFunc"]
      end

      %% The Context Manager invokes Tools and Functions
      Functions --> |Uses| Tools


      %% Dynamic selection among functions based on the RAG approach
      CM -- "Uses" --> CF
      CF -- "Calls" --> GEF
      CF -- "Calls" --> GRF
      CF -- "Calls" --> VRF

Context Manager#

The Context Manager is the central coordinator of the system. Its responsibilities include:

Asynchronous Processing:
- Manages the flow between data ingestion, function execution, and final document retrieval.
- Calls different functions based on the analysis of user queries and the type of RAG (Graph-RAG vs. Vector-RAG) configured.
Dynamic Function Execution:
- Loads and configures various functions (e.g., ChatFunction, Summarization Functions) by leveraging the modular function and tool interfaces.
- Configurable with different RAG approaches (vector-based, graph-based)

Tools and Functions#

The system is built upon a clear separation between the logic (functions) and external dependencies (tools):

Tools:
- Tools encapsulate the logic for interacting with external services such as databases (Neo4j for graph data, Milvus for vector embeddings).
- Examples include:
  - MilvusDBTool:
    - Interfaces with the Milvus vector database to manage document embeddings.
    - Provides efficient similarity searching by indexing and retrieving document embeddings.
  - Neo4jGraphDB (Neo4jTool):
    - Provides integration with the Neo4j graph database to store and query graph representations of documents.
    - Supports advanced retrieval strategies by converting documents into graph structures and executing cypher queries to retrieve relationships, entities, and structured information from the graph.
  - NotificationTool:
    - Serves as a lightweight interface for dispatching notifications and alerts within the system.
    - Can be extended to integrate with various notification endpoints.
Functions:
- The Function serves as the base interface for implementing processing routines in the Context Manager.
- Examples include:
  - ChatFunction:
    - Handles chat interactions
    - Manages conversation context
    - Integrates with LLM for response generation
  - GraphExtractionFunc:
    - Processes documents into graph representations
    - Handles entity extraction and relationship mapping
    - Manages document chunking and embedding generation
  - GraphRetrievalFunc:
    - Retrieves information using graph-based approach
    - Implements vector similarity search
    - Manages context assembly for LLM
  - VectorRetrievalFunc:
    - Retrieves information using vector-based approach
    - Handles embedding-based similarity search
    - Manages document ranking and selection

Service Architecture#

The Context-Aware RAG system consists of two main services:

Data Ingestion Service (default port: 8001) - Handles document ingestion and processing
Retrieval Service (default port: 8000) - Handles question answering and document retrieval

        ---
config:
  theme: dark

---
graph LR
    A[Client] --> B[Data Ingestion Service]
    A --> C[Retrieval Service]
    B --> D[Vector Store/Graph DB]
    C --> D

Service Communication#

Both services must be initialized with the same UUID to ensure proper communication
Services communicate through a shared database (vector store or graph DB)
Each service has its own dedicated port for client communication
Services can be scaled independently based on workload

Service Responsibilities#

Data Ingestion Service
- Handles document ingestion and processing
- Manages document chunking and embedding generation
- Stores processed documents in the database
- Provides health check endpoints
Retrieval Service
- Handles user queries and question answering
- Retrieves relevant documents from the database
- Generates responses using LLM
- Provides health check endpoints