Overview

ExuluContext represents a semantic search index that powers retrieval-augmented generation (RAG) in Exulu IMP. Each context maintains a collection of items with vector embeddings, enabling agents to query relevant information using natural language, keywords, or hybrid search.

Key features

Vector search

Semantic search with pgvector for similarity-based retrieval

Hybrid search

Combines vector similarity with full-text keyword search

Data sources

Scheduled data ingestion from external systems

Processors

Transform items before storage and embeddings generation

Auto tool generation

Automatically exposed as a tool for agents to use

Chunk expansion

Retrieve surrounding context chunks for better results

What is a context?

A context is a semantic search index that:

Stores structured data with custom fields (text, numbers, files, JSON, etc.)
Generates embeddings using a configured embedder (OpenAI, Anthropic, etc.)
Enables semantic search through vector similarity and full-text search
Automatically becomes a tool that agents can call to retrieve information
Manages data sources that periodically sync external data

Think of a context as a specialized database table optimized for AI retrieval with semantic understanding.

Quick start

import { ExuluContext, ExuluEmbedder } from "@exulu/backend";

// Create an embedder
const embedder = new ExuluEmbedder({
  id: "openai_embedder",
  name: "OpenAI Embeddings",
  provider: "openai",
  model: "text-embedding-3-small",
  vectorDimensions: 1536
});

// Create a context
const docsContext = new ExuluContext({
  id: "documentation",
  name: "Documentation",
  description: "Product documentation and help articles",
  active: true,
  embedder: embedder,
  fields: [
    {
      name: "title",
      type: "text",
      required: true
    },
    {
      name: "content",
      type: "text",
      required: true
    },
    {
      name: "url",
      type: "text"
    }
  ],
  sources: [],
  configuration: {
    calculateVectors: "onInsert",
    enableAsTool: true,
    maxRetrievalResults: 10
  }
});

// Use in ExuluApp
const app = new ExuluApp();
await app.create({
  contexts: {
    documentation: docsContext
  },
  config: { /* ... */ }
});

Architecture

Database structure

Each context creates two PostgreSQL tables:

Items table

Stores the actual data items with your custom fields plus built-in fields:

id - UUID primary key
name - Item name
description - Item description
external_id - Optional external identifier for syncing
tags - Comma-separated tags
created_by - User who created the item
rights_mode - Access control mode (private, public, restricted)
embeddings_updated_at - Last embeddings generation timestamp
chunks_count - Number of embedding chunks
Your custom fields

Chunks table

Stores embedding chunks generated from items:

id - UUID primary key
source - Foreign key to items table
content - Text content of the chunk
chunk_index - Position within the document
metadata - JSON metadata
embedding - Vector embedding (pgvector)
fts - Full-text search index (tsvector)

Search methods

Hybrid search
Semantic search
Keyword search

Combines semantic understanding with keyword matching. Best for most queries.

const results = await context.search({
  query: "How do I configure authentication?",
  keywords: ["auth", "config"],
  method: "hybridSearch",
  limit: 10
});

Uses vector similarity for conceptual matching. Best for questions with synonyms and paraphrasing.

const results = await context.search({
  query: "authentication setup process",
  method: "cosineDistance",
  limit: 10
});

Full-text search for exact term matching. Best for technical names, IDs, or specific phrases.

const results = await context.search({
  keywords: ["API-KEY-123", "production"],
  method: "tsvector",
  limit: 10
});

Core concepts

Fields

Define the structure of your items with typed fields:

fields: [
  { name: "title", type: "text", required: true },
  { name: "category", type: "text", index: true },
  { name: "priority", type: "number", default: 0 },
  { name: "metadata", type: "json" },
  { name: "document", type: "file", allowedFileTypes: ["pdf", "docx"] },
  { name: "status", type: "text", enumValues: ["draft", "published"] }
]

Field types include: text, number, boolean, date, json, file, and more. See the configuration guide for all available types.

Embedder

The embedder generates vector representations of your items:

embedder: new ExuluEmbedder({
  id: "my_embedder",
  name: "OpenAI Embeddings",
  provider: "openai",
  model: "text-embedding-3-small",
  vectorDimensions: 1536,
  template: "Title: {{title}}\n\nContent: {{content}}"
})

The embedder uses a template to format item data before generating embeddings.

Sources

Data sources automatically sync external data into your context:

sources: [
  {
    id: "notion_sync",
    name: "Notion Documentation",
    description: "Syncs docs from Notion",
    config: {
      schedule: "0 * * * *", // Every hour
      queue: myQueue,
      retries: 3
    },
    execute: async ({ exuluConfig }) => {
      // Fetch data from Notion API
      const pages = await fetchNotionPages();
      return pages.map(page => ({
        external_id: page.id,
        name: page.title,
        content: page.content
      }));
    }
  }
]

Processor

Processors transform items before storage or embeddings generation:

processor: {
  name: "Extract text from PDF",
  config: {
    trigger: "onInsert",
    generateEmbeddings: true
  },
  execute: async ({ item, utils }) => {
    // Extract text from PDF file
    const text = await utils.storage.extractText(item.document_s3key);
    return {
      ...item,
      content: text
    };
  }
}

Usage patterns

As an agent tool

When enableAsTool: true, contexts automatically become tools that agents can call:

// Agent receives this tool automatically
{
  name: "documentation_context_search",
  description: "Gets information from the context called: Documentation...",
  inputSchema: {
    query: "The user's question",
    keywords: ["relevant", "terms"],
    method: "hybrid" | "semantic" | "keyword"
  }
}

Agents will intelligently choose when to query your context based on the user’s question.

Direct querying

You can also query contexts directly in your code:

const context = app.context("documentation");
const results = await context.search({
  query: "How do I configure CORS?",
  keywords: ["CORS", "config"],
  method: "hybridSearch",
  itemFilters: [
    { field: "category", operator: "=", value: "configuration" }
  ],
  limit: 5
});

console.log(results.chunks); // Array of matching chunks with metadata

Managing items

Create, update, and delete items programmatically:

// Create an item
const { item, job } = await context.createItem(
  {
    external_id: "doc-123",
    name: "Getting Started",
    content: "Welcome to our platform..."
  },
  config,
  userId
);

// Update an item
await context.updateItem(
  {
    id: item.id,
    content: "Updated content..."
  },
  config,
  userId
);

// Delete an item
await context.deleteItem({ id: item.id });

Search features

Chunk expansion

Retrieve surrounding chunks for more context:

configuration: {
  expand: {
    before: 1, // Include 1 chunk before match
    after: 2   // Include 2 chunks after match
  }
}

Search cutoffs

Filter results by relevance score:

configuration: {
  cutoffs: {
    cosineDistance: 0.7,  // Only results with >0.7 similarity
    tsvector: 0.5,        // Only results with >0.5 keyword score
    hybrid: 0.6           // Only results with >0.6 combined score
  }
}

Query rewriting

Improve search quality with query rewriting:

queryRewriter: async (query: string) => {
  // Call LLM to expand/clarify the query
  const rewritten = await llm.rewrite(query);
  return rewritten;
}

Result reranking

Reorder results based on custom logic:

resultReranker: async (results) => {
  // Use a reranker model or custom logic
  const reranked = await reranker.rerank(results);
  return reranked;
}

Overview

Overview

Key features

Vector search

Hybrid search

Data sources

Processors

Auto tool generation

Chunk expansion

What is a context?

Quick start

Architecture

Database structure

Search methods

Core concepts

Fields

Embedder

Sources

Processor

Usage patterns

As an agent tool

Direct querying

Managing items

Search features

Chunk expansion

Search cutoffs

Query rewriting

Result reranking

Next steps

Configuration

API reference

​Overview

​Key features

Vector search

Hybrid search

Data sources

Processors

Auto tool generation

Chunk expansion

​What is a context?

​Quick start

​Architecture

​Database structure

​Search methods

​Core concepts

​Fields

​Embedder

​Sources

​Processor

​Usage patterns

​As an agent tool

​Direct querying

​Managing items

​Search features

​Chunk expansion

​Search cutoffs

​Query rewriting

​Result reranking

​Next steps

Configuration

API reference

Overview

Key features

What is a context?

Quick start

Architecture

Database structure

Search methods

Core concepts

Fields

Embedder

Sources

Processor

Usage patterns

As an agent tool

Direct querying

Managing items

Search features

Chunk expansion

Search cutoffs

Query rewriting

Result reranking

Next steps