Documentation Knowledge Getting Started

Getting Started with Knowledge

Learn how to create knowledge collections, add documents, and use RAG in your conversational flows.

Overview

Knowledge collections allow you to build a searchable knowledge base from your documents, websites, and other content sources. Your chatbot can then retrieve relevant information to provide accurate, context-aware responses.

What You'll Learn

  • How knowledge collections work
  • The document processing pipeline
  • Adding and managing content sources
  • Using RAG nodes in your flows

Core Concepts

Collections

A collection is a container for related knowledge. You might have separate collections for different topics, departments, or use cases (e.g., "Product Documentation", "HR Policies", "Customer FAQs").

Documents

A document represents a single piece of content from a source—a webpage, uploaded file, or manual entry. Each document is automatically processed and broken down into smaller pieces.

Semantic Search

The system uses semantic search to understand the meaning of questions, not just keywords. This means it can find relevant information even when exact words don't match—for example, finding "refund procedure" when a user asks "how do I get my money back?"

How It Works Together

1
You add content Upload files, connect websites, or paste text into a collection
2
System processes it Documents are chunked, enriched with metadata, and embedded
3
Chatbot searches it RAG nodes find relevant chunks using hybrid semantic + keyword search
4
Response generated AI uses retrieved information to provide accurate, grounded answers

Creating Your First Collection

  1. Navigate to Knowledge
    From your dashboard, go to the Knowledge section in the sidebar.
  2. Create Collection
    Click "New Collection" and give it a descriptive name (e.g., "Product Documentation").
  3. Add a Description (optional)
    Describe what content this collection contains. This helps when you have multiple collections.
  4. Save the Collection
    Your empty collection is now ready to receive content.

Tip: Organize by Topic or Use Case

Create separate collections for different subject areas. This makes it easier to control which knowledge your chatbot searches and improves retrieval accuracy.

Adding Content to Collections

There are four ways to add content to a knowledge collection:

1. Upload Files

Upload documents directly to your collection. Supported formats include:

  • Text files (.txt, .md)
  • HTML files (.html)
  • PDF files (.pdf)
  • Other document formats processed by the system

The system automatically extracts text content and converts it to a searchable format.

2. Scrape Websites

Point to a website and let the system crawl and index its content:

  • Enter a base URL (e.g., https://docs.yoursite.com)
  • Set a maximum page limit to control crawling scope
  • The crawler follows internal links and indexes all discovered pages
  • Content is automatically converted from HTML to markdown

3. Add Specific URLs

Index specific pages without crawling entire sites:

  • Provide a list of URLs to scrape
  • Each URL is fetched and processed independently
  • Perfect for curating content from multiple sources

4. Manual Entry

Type or paste content directly:

  • Create documents manually in the UI
  • Paste text from other sources
  • Ideal for quick additions or proprietary content

Syncing and Updates

Website and URL sources can be configured to sync automatically (daily, weekly, or monthly) to keep your knowledge base up to date with the latest content.

Learn more about each content source type in the Document Sources guide.

How Content is Processed

When you add content to a collection, it's automatically processed to make it searchable and optimized for AI retrieval. Here's what happens:

Extract

Text is extracted from files, websites, or manual entries and cleaned up for processing.

Optimize

Content is split into optimal chunks and enhanced with metadata for better retrieval.

Index

Processed chunks are indexed for both keyword and semantic search.

Processing Happens in the Background

You don't need to wait for processing to complete. All steps run automatically in the background, typically taking a few minutes per document. You'll see status indicators in the UI as documents are processed.

Using Knowledge in Your Flows

Once your collection has processed content, you can use it in conversational flows. We recommend starting with the Knowledge Agent node for the simplest and most powerful experience.

Knowledge Agent (Recommended)

The Knowledge Agent node provides an all-in-one solution for answering questions using your knowledge base:

  • Automatically searches your collections to find relevant information
  • Generates accurate, well-cited answers based on retrieved content
  • Handles follow-up questions with conversation context
  • Cites sources so users can verify information
  • No additional nodes needed—it handles everything internally

Simplest Way to Get Started

The Knowledge Agent is the easiest way to add knowledge-powered Q&A to your chatbot. Just select your collections and it handles search, retrieval, and answer generation automatically.

Advanced: RAG Retriever Nodes

For more control over the retrieval and response generation process, you can use RAG Retriever nodes:

  • RAG Retriever: Basic search and retrieval. Returns chunks that you can use with Content Generator nodes for custom responses.
  • Agentic RAG Retriever: Advanced retrieval with automatic query reformulation, multi-hop reasoning, and result quality assessment.

These nodes give you fine-grained control but require connecting additional nodes (like Content Generator) to create complete responses.