Learn how to create knowledge collections, add documents, and use RAG in your conversational flows.
Knowledge collections allow you to build a searchable knowledge base from your documents, websites, and other content sources. Your chatbot can then retrieve relevant information to provide accurate, context-aware responses.
What You'll Learn
A collection is a container for related knowledge. You might have separate collections for different topics, departments, or use cases (e.g., "Product Documentation", "HR Policies", "Customer FAQs").
A document represents a single piece of content from a source—a webpage, uploaded file, or manual entry. Each document is automatically processed and broken down into smaller pieces.
The system uses semantic search to understand the meaning of questions, not just keywords. This means it can find relevant information even when exact words don't match—for example, finding "refund procedure" when a user asks "how do I get my money back?"
Tip: Organize by Topic or Use Case
Create separate collections for different subject areas. This makes it easier to control which knowledge your chatbot searches and improves retrieval accuracy.
There are four ways to add content to a knowledge collection:
Upload documents directly to your collection. Supported formats include:
.txt, .md).html).pdf)The system automatically extracts text content and converts it to a searchable format.
Point to a website and let the system crawl and index its content:
https://docs.yoursite.com)Index specific pages without crawling entire sites:
Type or paste content directly:
Syncing and Updates
Website and URL sources can be configured to sync automatically (daily, weekly, or monthly) to keep your knowledge base up to date with the latest content.
Learn more about each content source type in the Document Sources guide.
When you add content to a collection, it's automatically processed to make it searchable and optimized for AI retrieval. Here's what happens:
Text is extracted from files, websites, or manual entries and cleaned up for processing.
Content is split into optimal chunks and enhanced with metadata for better retrieval.
Processed chunks are indexed for both keyword and semantic search.
Processing Happens in the Background
You don't need to wait for processing to complete. All steps run automatically in the background, typically taking a few minutes per document. You'll see status indicators in the UI as documents are processed.
Once your collection has processed content, you can use it in conversational flows. We recommend starting with the Knowledge Agent node for the simplest and most powerful experience.
The Knowledge Agent node provides an all-in-one solution for answering questions using your knowledge base:
Simplest Way to Get Started
The Knowledge Agent is the easiest way to add knowledge-powered Q&A to your chatbot. Just select your collections and it handles search, retrieval, and answer generation automatically.
For more control over the retrieval and response generation process, you can use RAG Retriever nodes:
These nodes give you fine-grained control but require connecting additional nodes (like Content Generator) to create complete responses.