Built-in Components

Relevant source files

This page documents the standard components available in RAGFlow's Canvas workflow system. These components serve as building blocks for constructing AI agent workflows, ranging from simple LLM invocations to complex multi-tool agent orchestrations with retrieval and conditional routing.

For information about the component architecture and registration system, see Component System Architecture. For details on workflow execution and how components interact during runtime, see Workflow Execution and Streaming.

Component Overview

RAGFlow provides a library of built-in components that inherit from ComponentBase (agent/component/base.py365-585) and define reusable workflow operations. Each component consists of a parameter class (inheriting from ComponentParamBase) and an implementation class that provides the _invoke method.

Component Categories

Built-in components fall into five functional categories:

Category	Components	Purpose
LLM Interaction	LLM, Agent	Direct LLM calls with prompt templates and tool-augmented ReAct agents
Information Retrieval	Retrieval, Generate	Knowledge base search and context-aware content generation
Control Flow	Categorize, Switch, Iteration, Loop, ExitLoop	Conditional branching and iterative execution
User Interaction	Begin, UserFillUp, Message	Workflow entry points and user input collection
Content Processing	Template	Dynamic prompt construction with variable substitution

LLM Component

The LLM component (agent/component/llm.py82-448) provides direct access to language models with configurable prompts, generation parameters, and optional structured output.

Component Structure

Sources: agent/component/llm.py33-80 agent/component/llm.py82-448

Configuration Parameters

Core Parameters (agent/component/llm.py33-59):

llm_id (required): Model identifier from tenant LLM configuration
sys_prompt: System-level instructions prepended to conversation
prompts: List of message dictionaries with role and content fields
message_history_window_size: Number of previous messages to include (default: 13)

Generation Control (agent/component/llm.py61-79):

temperature: Sampling randomness (0.0-1.0)
max_tokens: Maximum response length
top_p: Nucleus sampling threshold
presence_penalty: Repetition penalty for new topics
frequency_penalty: Repetition penalty for frequent tokens

Advanced Features (agent/component/llm.py48-50):

cite: Enable automatic citation insertion for retrieved content
output_structure: JSON schema for structured output validation
visual_files_var: Variable reference for image inputs

Variable Resolution in Prompts

The LLM component supports variable references in both system prompts and message content using the syntax {component_id@output_var} or {sys.variable}. Variables are resolved during invocation (agent/component/llm.py226-259):

Sources: agent/component/base.py500-511 agent/component/llm.py102-109 agent/canvas.py164-236

Structured Output Mode

When output_structure is configured with a JSON schema, the component enforces format compliance through retry logic (agent/component/llm.py365-448):

System prompt is augmented with schema specification via structured_output_prompt() (rag/prompts/generator.py443-445)
LLM generates response attempting to match schema
Response is parsed with json_repair.loads() for robustness
On validation failure, the component retries up to max_retries times
If all retries fail, sets _ERROR output instead of throwing exception

Image Input Handling

The component automatically detects base64-encoded images in input variables (agent/component/llm.py129-167):

Extracts data:image/ prefixed strings from all input values
Removes image data from text content to avoid token bloat
Switches to LLMType.IMAGE2TEXT model if images are present (agent/component/llm.py248-252)
Passes images to LLMBundle.async_chat() via images parameter

Agent Component

The Agent component (agent/component/agent_with_tools.py83-534) implements a ReAct (Reasoning + Acting) loop that iteratively calls language models to plan actions, invoke tools, and reflect on results until completing the task.

Architecture

Sources: agent/component/agent_with_tools.py38-81 agent/component/agent_with_tools.py83-118

Tool Loading and Registration

The agent loads tool components during initialization (agent/component/agent_with_tools.py88-116):

For MCP (Model Context Protocol) servers, the agent establishes connections and registers remote tools (agent/component/agent_with_tools.py108-114).

ReAct Execution Loop

The agent's core execution logic (agent/component/agent_with_tools.py278-419) follows this pattern:

Sources: agent/component/agent_with_tools.py278-419 rag/prompts/generator.py395-414

Task Analysis and Planning

The agent uses specialized prompts for different phases (rag/prompts/generator.py172-177):

Task Analysis (ANALYZE_TASK_SYSTEM + ANALYZE_TASK_USER): Breaks down the user request and available tools
Next Step Planning (NEXT_STEP): Determines which tool to invoke next based on conversation history
Reflection (REFLECT): Summarizes tool call results and updates understanding

The next_step_async() function (rag/prompts/generator.py395-414) orchestrates tool selection:

Sources: rag/prompts/generator.py395-414 agent/component/agent_with_tools.py376-410

Tool Call Callback and Tracing

Every tool invocation triggers a callback that logs execution details to Redis (agent/canvas.py779-801):

This enables the trace visualization feature accessed via /canvas/trace endpoint (api/apps/canvas_app.py551-563).

Structured Output with Tools

When both tools and structured output are configured, the agent completes tool calls before enforcing schema validation (agent/component/agent_with_tools.py200-254):

Execute ReAct loop until complete_task is called
Collect full response text
Attempt JSON parsing with retry logic
Return structured object via structured output key

Categorize Component

The Categorize component (agent/component/categorize.py97-164) classifies user input into predefined categories and routes execution to different downstream components based on the classification result.

Component Configuration

Sources: agent/component/categorize.py29-95 agent/component/categorize.py97-164

Category Definition Structure

Each category in category_description is a dictionary with three fields (agent/component/categorize.py36-48):

Classification Prompt Generation

The component dynamically builds a classification prompt (agent/component/categorize.py58-94):

Lists all category names
Includes category descriptions
Provides example mappings: USER: "query" → category_name
Instructs LLM to return only the category name

Execution Flow

Sources: agent/component/categorize.py107-156 agent/canvas.py612-613

Fallback Behavior

If the LLM response doesn't clearly match any category, the component defaults to the last defined category (agent/component/categorize.py149-153). This ensures workflow execution always continues.

Retrieval Component

The Retrieval component (referenced in agent/canvas.py52-59) performs hybrid search over knowledge bases, combining vector similarity and keyword matching to retrieve relevant document chunks.

Typical Configuration

Based on the DSL structure in Canvas initialization:

Output Format

The component produces two output variables:

content: Formatted string representation of retrieved chunks (via kb_prompt() from rag/prompts/generator.py99-143)
_references: Structured list of chunk metadata for citation

Retrieval results are automatically added to the canvas reference store (agent/canvas.py803-821):

Generate Component

The Generate component (mentioned in agent/canvas.py60-67) performs context-aware text generation, typically consuming retrieval results and user queries to produce grounded answers.

Standard Workflow Pattern

Sources: agent/canvas.py42-78

Citation Integration

When the Generate component's cite parameter is enabled and retrieval results exist, the component automatically inserts citations using the pattern [ID: N] where N is a hash of the chunk ID (rag/prompts/generator.py159 common/misc_utils.py25).

The citation prompt template (rag/prompts/generator.py159-160) instructs the LLM to:

Use inline citations in format [ID: hash]
Only cite chunks actually referenced
Group citations at end of relevant sentences

Template Component

The Template component (referenced in workflow examples) provides Jinja2-based templating for dynamic prompt construction with complex variable substitution.

Usage Pattern

The template engine supports:

Variable expansion from canvas state
Jinja2 control structures (loops, conditionals)
Filters and functions for text transformation

Message Component

The Message component (agent/canvas.py503-562) handles final output delivery to users, supporting text content, attachments, status codes, and optional text-to-speech conversion.

Output Fields

Sources: agent/canvas.py503-562

Auto-Play TTS Feature

When auto_play parameter is enabled (agent/canvas.py504-505), the component:

Initializes a LLMBundle with LLMType.TTS
Streams content chunks to TTS model via tts() method (agent/canvas.py675-716)
Accumulates audio binary and encodes as hex
Emits audio_binary in message events for client-side playback

Text is cleaned before TTS (agent/canvas.py676-703):

Removes control characters and emojis
Truncates to 500 characters max
Strips markdown formatting

Citation Reference Injection

If the message content contains citation patterns ([ID: N]), the component automatically attaches retrieval references via get_reference() (agent/canvas.py560-561 agent/canvas.py818-821).

Begin Component

The Begin component (agent/canvas.py446-448) serves as the workflow entry point, accepting initial user inputs and system-level configuration.

Operating Modes

Standard Mode: Receives query and files via Canvas API (api/apps/canvas_app.py132-186)

Webhook Mode (agent/canvas.py378-387): Accepts payload via HTTP webhook with custom input mapping:

Prologue Feature

The Begin component's prologue parameter (agent/canvas.py733-734) defines an initial assistant message shown before workflow execution, useful for greetings or instructions.

UserFillUp Component

The UserFillUp component (agent/canvas.py628-642) pauses workflow execution to collect additional user inputs mid-execution.

Execution Behavior

Sources: agent/canvas.py628-642 agent/canvas.py410-412

Input Definition

The component defines required inputs through its parameter class:

When enable_tips is true, the tips content is shown to users in the input prompt (agent/canvas.py638-639).

Control Flow Components

Switch Component

The Switch component (referenced in agent/canvas.py612) provides multi-way branching based on expression evaluation, similar to switch-case statements in programming languages.

Iteration Component

The Iteration component (agent/canvas.py608-611 agent/canvas.py614-615) repeats its child components over a collection:

Sources: agent/canvas.py608-621

Loop Component

The Loop component (agent/canvas.py608-611 agent/canvas.py614-621) provides conditional looping with explicit exit conditions:

Sources: agent/canvas.py608-621

When an ExitLoop component is executed within a Loop's child context, the loop terminates and execution proceeds to the Loop's downstream components (agent/canvas.py616-617).

Component Input/Output Patterns

All components follow consistent I/O patterns defined by the base classes.

Input Element Resolution

Components declare inputs by scanning their parameter values for variable references (agent/component/base.py500-511):

Output Storage

Components write outputs using set_output(key, value) (agent/component/base.py458-461):

Special output keys:

_ERROR: Signals component failure, triggers exception handler (agent/component/base.py463-464)
_next: Controls routing for branching components (agent/canvas.py612-613)
_created_time / _elapsed_time: Performance metrics (agent/component/base.py408 agent/component/base.py418)

Variable Reference Patterns

Pattern	Example	Description
Component output	`{retrieval_0@content}`	References specific component's output variable
System variable	`{sys.query}`	Accesses canvas global state
Environment variable	`{env.api_key}`	User-defined workflow variables
Nested access	`{[email protected]}`	Dot notation for nested object fields

Sources: agent/component/base.py368 agent/canvas.py164-206

Component Debugging

The Canvas system provides a debug endpoint for testing individual components in isolation (api/apps/canvas_app.py332-366).

Debug Request Format

Debug Execution Flow

Sources: api/apps/canvas_app.py332-366

For LLM and Agent components, streaming outputs (stored as partial objects) are fully resolved during debug to return complete text (api/apps/canvas_app.py354-362).

Component Lifecycle

Sources: agent/component/base.py407-447 agent/canvas.py571-581

Exception Handling Configuration

Components support three exception handling strategies (agent/component/base.py567-582):

Goto: Redirect to alternative component path via exception_goto parameter
Default Value: Return fallback value from exception_default_value
Propagate: Let exception halt workflow (default behavior)

Exception configuration is stored in component parameters (agent/component/base.py46-50) and evaluated after component execution (agent/canvas.py571-581).

Sources: agent/component/base.py365-585 agent/component/llm.py33-448 agent/component/agent_with_tools.py38-534 agent/component/categorize.py29-164 agent/canvas.py40-831 api/apps/canvas_app.py132-366 rag/prompts/generator.py159-467 agent/tools/base.py34-216

Built-in Components

Relevant source files

Component Overview

Component Categories

Built-in components fall into five functional categories:

Category	Components	Purpose
LLM Interaction	LLM, Agent	Direct LLM calls with prompt templates and tool-augmented ReAct agents
Information Retrieval	Retrieval, Generate	Knowledge base search and context-aware content generation
Control Flow	Categorize, Switch, Iteration, Loop, ExitLoop	Conditional branching and iterative execution
User Interaction	Begin, UserFillUp, Message	Workflow entry points and user input collection
Content Processing	Template	Dynamic prompt construction with variable substitution

LLM Component

The LLM component (agent/component/llm.py82-448) provides direct access to language models with configurable prompts, generation parameters, and optional structured output.

Component Structure

Sources: agent/component/llm.py33-80 agent/component/llm.py82-448

Configuration Parameters

Core Parameters (agent/component/llm.py33-59):

llm_id (required): Model identifier from tenant LLM configuration
sys_prompt: System-level instructions prepended to conversation
prompts: List of message dictionaries with role and content fields
message_history_window_size: Number of previous messages to include (default: 13)

Generation Control (agent/component/llm.py61-79):

temperature: Sampling randomness (0.0-1.0)
max_tokens: Maximum response length
top_p: Nucleus sampling threshold
presence_penalty: Repetition penalty for new topics
frequency_penalty: Repetition penalty for frequent tokens

Advanced Features (agent/component/llm.py48-50):

cite: Enable automatic citation insertion for retrieved content
output_structure: JSON schema for structured output validation
visual_files_var: Variable reference for image inputs

Variable Resolution in Prompts

Sources: agent/component/base.py500-511 agent/component/llm.py102-109 agent/canvas.py164-236

Structured Output Mode

When output_structure is configured with a JSON schema, the component enforces format compliance through retry logic (agent/component/llm.py365-448):

System prompt is augmented with schema specification via structured_output_prompt() (rag/prompts/generator.py443-445)
LLM generates response attempting to match schema
Response is parsed with json_repair.loads() for robustness
On validation failure, the component retries up to max_retries times
If all retries fail, sets _ERROR output instead of throwing exception

Image Input Handling

The component automatically detects base64-encoded images in input variables (agent/component/llm.py129-167):

Extracts data:image/ prefixed strings from all input values
Removes image data from text content to avoid token bloat
Switches to LLMType.IMAGE2TEXT model if images are present (agent/component/llm.py248-252)
Passes images to LLMBundle.async_chat() via images parameter

Agent Component

Architecture

Sources: agent/component/agent_with_tools.py38-81 agent/component/agent_with_tools.py83-118

Tool Loading and Registration

The agent loads tool components during initialization (agent/component/agent_with_tools.py88-116):

For MCP (Model Context Protocol) servers, the agent establishes connections and registers remote tools (agent/component/agent_with_tools.py108-114).

ReAct Execution Loop

The agent's core execution logic (agent/component/agent_with_tools.py278-419) follows this pattern:

Sources: agent/component/agent_with_tools.py278-419 rag/prompts/generator.py395-414

Task Analysis and Planning

The agent uses specialized prompts for different phases (rag/prompts/generator.py172-177):

Task Analysis (ANALYZE_TASK_SYSTEM + ANALYZE_TASK_USER): Breaks down the user request and available tools
Next Step Planning (NEXT_STEP): Determines which tool to invoke next based on conversation history
Reflection (REFLECT): Summarizes tool call results and updates understanding

The next_step_async() function (rag/prompts/generator.py395-414) orchestrates tool selection:

Sources: rag/prompts/generator.py395-414 agent/component/agent_with_tools.py376-410

Tool Call Callback and Tracing

Every tool invocation triggers a callback that logs execution details to Redis (agent/canvas.py779-801):

This enables the trace visualization feature accessed via /canvas/trace endpoint (api/apps/canvas_app.py551-563).

Structured Output with Tools

When both tools and structured output are configured, the agent completes tool calls before enforcing schema validation (agent/component/agent_with_tools.py200-254):

Execute ReAct loop until complete_task is called
Collect full response text
Attempt JSON parsing with retry logic
Return structured object via structured output key

Categorize Component

Component Configuration

Sources: agent/component/categorize.py29-95 agent/component/categorize.py97-164

Category Definition Structure

Each category in category_description is a dictionary with three fields (agent/component/categorize.py36-48):

Classification Prompt Generation

The component dynamically builds a classification prompt (agent/component/categorize.py58-94):

Lists all category names
Includes category descriptions
Provides example mappings: USER: "query" → category_name
Instructs LLM to return only the category name

Execution Flow

Sources: agent/component/categorize.py107-156 agent/canvas.py612-613

Fallback Behavior

Retrieval Component

The Retrieval component (referenced in agent/canvas.py52-59) performs hybrid search over knowledge bases, combining vector similarity and keyword matching to retrieve relevant document chunks.

Typical Configuration

Based on the DSL structure in Canvas initialization:

Output Format

The component produces two output variables:

content: Formatted string representation of retrieved chunks (via kb_prompt() from rag/prompts/generator.py99-143)
_references: Structured list of chunk metadata for citation

Retrieval results are automatically added to the canvas reference store (agent/canvas.py803-821):

Generate Component

The Generate component (mentioned in agent/canvas.py60-67) performs context-aware text generation, typically consuming retrieval results and user queries to produce grounded answers.

Standard Workflow Pattern

Sources: agent/canvas.py42-78

Citation Integration

The citation prompt template (rag/prompts/generator.py159-160) instructs the LLM to:

Use inline citations in format [ID: hash]
Only cite chunks actually referenced
Group citations at end of relevant sentences

Template Component

The Template component (referenced in workflow examples) provides Jinja2-based templating for dynamic prompt construction with complex variable substitution.

Usage Pattern

The template engine supports:

Variable expansion from canvas state
Jinja2 control structures (loops, conditionals)
Filters and functions for text transformation

Message Component

The Message component (agent/canvas.py503-562) handles final output delivery to users, supporting text content, attachments, status codes, and optional text-to-speech conversion.

Output Fields

Sources: agent/canvas.py503-562

Auto-Play TTS Feature

When auto_play parameter is enabled (agent/canvas.py504-505), the component:

Initializes a LLMBundle with LLMType.TTS
Streams content chunks to TTS model via tts() method (agent/canvas.py675-716)
Accumulates audio binary and encodes as hex
Emits audio_binary in message events for client-side playback

Text is cleaned before TTS (agent/canvas.py676-703):

Removes control characters and emojis
Truncates to 500 characters max
Strips markdown formatting

Citation Reference Injection

If the message content contains citation patterns ([ID: N]), the component automatically attaches retrieval references via get_reference() (agent/canvas.py560-561 agent/canvas.py818-821).

Begin Component

The Begin component (agent/canvas.py446-448) serves as the workflow entry point, accepting initial user inputs and system-level configuration.

Operating Modes

Standard Mode: Receives query and files via Canvas API (api/apps/canvas_app.py132-186)

Webhook Mode (agent/canvas.py378-387): Accepts payload via HTTP webhook with custom input mapping:

Prologue Feature

The Begin component's prologue parameter (agent/canvas.py733-734) defines an initial assistant message shown before workflow execution, useful for greetings or instructions.

UserFillUp Component

The UserFillUp component (agent/canvas.py628-642) pauses workflow execution to collect additional user inputs mid-execution.

Execution Behavior

Sources: agent/canvas.py628-642 agent/canvas.py410-412

Input Definition

The component defines required inputs through its parameter class:

When enable_tips is true, the tips content is shown to users in the input prompt (agent/canvas.py638-639).

Control Flow Components

Switch Component

The Switch component (referenced in agent/canvas.py612) provides multi-way branching based on expression evaluation, similar to switch-case statements in programming languages.

Iteration Component

The Iteration component (agent/canvas.py608-611 agent/canvas.py614-615) repeats its child components over a collection:

Sources: agent/canvas.py608-621

Loop Component

The Loop component (agent/canvas.py608-611 agent/canvas.py614-621) provides conditional looping with explicit exit conditions:

Sources: agent/canvas.py608-621

When an ExitLoop component is executed within a Loop's child context, the loop terminates and execution proceeds to the Loop's downstream components (agent/canvas.py616-617).

Component Input/Output Patterns

All components follow consistent I/O patterns defined by the base classes.

Input Element Resolution

Components declare inputs by scanning their parameter values for variable references (agent/component/base.py500-511):

Output Storage

Components write outputs using set_output(key, value) (agent/component/base.py458-461):

Special output keys:

_ERROR: Signals component failure, triggers exception handler (agent/component/base.py463-464)
_next: Controls routing for branching components (agent/canvas.py612-613)
_created_time / _elapsed_time: Performance metrics (agent/component/base.py408 agent/component/base.py418)

Variable Reference Patterns

Pattern	Example	Description
Component output	`{retrieval_0@content}`	References specific component's output variable
System variable	`{sys.query}`	Accesses canvas global state
Environment variable	`{env.api_key}`	User-defined workflow variables
Nested access	`{[email protected]}`	Dot notation for nested object fields

Sources: agent/component/base.py368 agent/canvas.py164-206

Component Debugging

The Canvas system provides a debug endpoint for testing individual components in isolation (api/apps/canvas_app.py332-366).

Debug Request Format

Debug Execution Flow

Sources: api/apps/canvas_app.py332-366

For LLM and Agent components, streaming outputs (stored as partial objects) are fully resolved during debug to return complete text (api/apps/canvas_app.py354-362).

Component Lifecycle

Sources: agent/component/base.py407-447 agent/canvas.py571-581

Exception Handling Configuration

Components support three exception handling strategies (agent/component/base.py567-582):

Goto: Redirect to alternative component path via exception_goto parameter
Default Value: Return fallback value from exception_default_value
Propagate: Let exception halt workflow (default behavior)

Exception configuration is stored in component parameters (agent/component/base.py46-50) and evaluated after component execution (agent/canvas.py571-581).

Built-in Components

Component Overview

Component Categories

LLM Component

Component Structure

Configuration Parameters

Variable Resolution in Prompts

Structured Output Mode

Image Input Handling

Agent Component

Architecture

Tool Loading and Registration

ReAct Execution Loop

Task Analysis and Planning

Tool Call Callback and Tracing

Structured Output with Tools

Categorize Component

Component Configuration

Category Definition Structure

Classification Prompt Generation

Execution Flow

Fallback Behavior

Retrieval Component

Typical Configuration

Output Format

Generate Component

Standard Workflow Pattern

Citation Integration

Template Component

Usage Pattern

Message Component

Output Fields

Auto-Play TTS Feature

Citation Reference Injection

Begin Component

Operating Modes

Prologue Feature

UserFillUp Component

Execution Behavior

Input Definition

Control Flow Components

Switch Component

Iteration Component

Loop Component

Component Input/Output Patterns

Input Element Resolution

Output Storage

Variable Reference Patterns

Component Debugging

Debug Request Format

Debug Execution Flow

Component Lifecycle

Exception Handling Configuration

On this page

Built-in Components

Component Overview

Component Categories

LLM Component

Component Structure

Configuration Parameters

Variable Resolution in Prompts

Structured Output Mode

Image Input Handling

Agent Component

Architecture

Tool Loading and Registration

ReAct Execution Loop

Task Analysis and Planning

Tool Call Callback and Tracing

Structured Output with Tools

Categorize Component

Component Configuration

Category Definition Structure

Classification Prompt Generation

Execution Flow

Fallback Behavior

Retrieval Component

Typical Configuration

Output Format

Generate Component