Testing and Quality Assurance

Relevant source files

Purpose and Scope

This document describes the testing infrastructure, code quality tools, and quality assurance practices within DB-GPT. It covers test organization, the pytest testing framework, code quality tools (mypy for type checking and ruff for linting), test utilities, and guidelines for writing and running tests. The testing framework supports both unit tests (isolated component testing) and integration tests (multi-component interaction testing).

For information about the overall development workflow, see Development Environment Setup. For CI/CD pipelines and automated quality checks, see Contributing and CI/CD.

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py1-14 .mypy.ini1-127

Test Directory Structure

DB-GPT organizes tests into two primary categories: unit tests and integration tests. The test suite is located in the tests/ directory at the repository root.

Directory Organization

Test Directory Organization with Code Paths

Test Directory Organization

Directory Path	Test Type	Components Under Test	Key Test Files
`tests/unit_tests/`	Unit tests	Individual components in isolation	`test_plugins.py`
`tests/intetration_tests/`	Integration tests	Multi-component workflows	Directory structure only
`tests/unit_tests/embedding_engine/`	Unit tests	`dbgpt.EmbeddingEngine`, `KnowledgeType`	`url_test.py`, `document_test.py`
`tests/unit_tests/vector_store/`	Unit tests	`dbgpt.storage.vector_store`, `IndexStoreBase`, `IndexStoreConfig`	`test_pgvector.py`
`tests/intetration_tests/vector_store/`	Integration tests	End-to-end vector storage workflows	`__init__.py`
`tests/intetration_tests/kbqa/`	Integration tests	Knowledge-based QA pipelines	`__init__.py`

Note: The directory name intetration_tests contains an intentional misspelling in the codebase.

Sources: tests/unit_tests/embedding_engine/url_test.py1-21 tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py1-14 tests/intetration_tests/vector_store/__init__.py1 tests/intetration_tests/kbqa/__init__.py1

Test Types and Organization

Unit Tests

Unit tests focus on testing individual components in isolation. They use mocking and fixtures to eliminate external dependencies.

Example: Vector Store Import Validation

The test_vetorestore_imports function validates that all vector store classes properly implement the required base classes:

tests/unit_tests/vector_store/test_pgvector.py5-13

This test validates the structure of all vector store implementations by:

Iterating through all classes listed in dbgpt.storage.vector_store.__all__
Using getattr() to dynamically retrieve each (store_cls, config_cls) tuple
Asserting issubclass(store_cls, IndexStoreBase) for the store class
Asserting issubclass(config_cls, IndexStoreConfig) for the configuration class

This pattern ensures consistent interface implementation across all vector store backends (Milvus, Chroma, PGVector, Elasticsearch, Weaviate, OceanBase).

Example: Plugin Testing

The plugin test suite demonstrates comprehensive unit testing patterns for the dbgpt.plugins module:

tests/unit_tests/test_plugins.py19-25

The test_inspect_zip_for_modules function validates inspect_zip_for_modules() by verifying it correctly identifies Python module files within plugin ZIP archives.

Key testing patterns demonstrated:

Fixture-based mocking: tests/unit_tests/test_plugins.py28-38 creates MockConfig objects
Monkeypatching: tests/unit_tests/test_plugins.py45-46 uses monkeypatch.setattr("builtins.input", lambda _: "y") to simulate user input
Denylist/allowlist validation: tests/unit_tests/test_plugins.py41-86 tests denylist_allowlist_check() function behavior
Multiple test scenarios: Tests for user input variations (yes/no/invalid) and plugin scanning modes (OpenAI/generic)

Integration Tests

Integration tests validate end-to-end workflows involving multiple components. The directory structure separates integration tests by functional area.

Example Test Areas:

Vector Store Integration: Tests for embedding, indexing, and retrieval workflows
KBQA Integration: Tests for knowledge-based question answering pipelines

Sources: tests/unit_tests/vector_store/test_pgvector.py5-13 tests/unit_tests/test_plugins.py1-130

Testing Tools and Frameworks

Primary Testing Framework: pytest

DB-GPT uses pytest as its primary testing framework. The framework provides:

Fixtures: Reusable test setup and teardown
Parametrization: Running tests with multiple input sets
Monkeypatching: Mocking external dependencies
Assertions: Rich assertion introspection

pytest Framework Execution Flow

Fixture Example

Mock configuration fixture for plugin testing:

tests/unit_tests/test_plugins.py28-38

The mock_config_denylist_allowlist_check fixture:

Returns a MockConfig class instance
Provides plugins_denylist = ["BadPlugin"]
Provides plugins_allowlist = ["GoodPlugin"]
Defines authorise_key = "y" and exit_key = "n" for user input simulation
Used by multiple test functions via pytest's dependency injection

Monkeypatch Example

Simulating user input in plugin authorization tests:

tests/unit_tests/test_plugins.py59-66

The test_denylist_allowlist_check_user_input_yes function:

Receives monkeypatch fixture as parameter
Calls monkeypatch.setattr("builtins.input", lambda _: "y") to replace input() function
Tests denylist_allowlist_check("UnknownPlugin", mock_config_denylist_allowlist_check)
Asserts the function returns True when user approves

Sources: tests/unit_tests/test_plugins.py28-66

Test Categories by Component

Embedding Engine Tests

Embedding engine tests validate document and URL embedding functionality using the dbgpt.EmbeddingEngine class. These tests are located in tests/unit_tests/embedding_engine/.

Embedding Engine Test Code Structure

URL Embedding Test Structure

tests/unit_tests/embedding_engine/url_test.py1-20

The URL test:

Sets knowledge_source = "https://docs.dbgpt.site/docs/overview"
Uses KnowledgeType.URL.value for web content
Configures vector_store_name = url.replace(":", "") to create valid store name
Creates EmbeddingEngine instance with configuration
Calls embedding_engine.knowledge_embedding() to execute

Document Embedding Test Structure

tests/unit_tests/embedding_engine/document_test.py1-21

The document test:

Sets knowledge_source = document_path pointing to local file
Uses KnowledgeType.DOCUMENT.value for file content
Supports file types: .md, .pdf, .docx, .csv, .html
Uses vector_store_name = "document_test" as fixed identifier
Executes identical embedding workflow

Both tests follow the pattern:

Configure embedding_model, vector_store_type, and chroma_persist_path
Create vector_store_config dictionary with required keys
Initialize EmbeddingEngine with knowledge_source, knowledge_type, model_name, and vector_store_config
Call knowledge_embedding() method to execute embedding pipeline

Supported Document Types

Format	File Extension	Notes
Markdown	`.md`	Text-based documentation
PDF	`.pdf`	Portable document format
Word	`.docx`	Microsoft Word documents
CSV	`.csv`	Comma-separated values
HTML	`.html`	Web pages

Sources: tests/unit_tests/embedding_engine/url_test.py1-21 tests/unit_tests/embedding_engine/document_test.py1-21

Plugin Testing Infrastructure

Plugin testing validates the plugin system's security and functionality, including denylist/allowlist checks and plugin scanning.

Plugin Test Structure

Plugin Test Function Mapping to Code Under Test

Plugin Test Constants

tests/unit_tests/test_plugins.py12-16

Test resource constants:

Constant	Value	Purpose
`PLUGINS_TEST_DIR`	`"tests/unit/data/test_plugins"`	Original test plugin directory path
`PLUGINS_TEST_DIR_TEMP`	`"data/test_plugins"`	Alternative test plugin directory
`PLUGIN_TEST_ZIP_FILE`	`"Auto-GPT-Plugin-Test-master.zip"`	Sample plugin ZIP for `inspect_zip_for_modules()` tests
`PLUGIN_TEST_INIT_PY`	`"Auto-GPT-Plugin-Test-master/src/auto_gpt_vicuna/__init__.py"`	Expected module path within ZIP
`PLUGIN_TEST_OPENAI`	`"https://weathergpt.vercel.app/"`	OpenAI plugin URL for `scan_plugins()` tests

Mock Configuration Objects

Denylist/Allowlist Mock Config:

tests/unit_tests/test_plugins.py28-38

This mock provides:

plugins_denylist: List of blocked plugins
plugins_allowlist: List of approved plugins
authorise_key / exit_key: User input options

OpenAI Plugin Mock Config:

tests/unit_tests/test_plugins.py89-102

Generic Plugin Mock Config:

tests/unit_tests/test_plugins.py111-123

Test Scenarios

Test Function	Purpose	Mock/Fixture Used
`test_inspect_zip_for_modules`	Validates ZIP file module inspection	File system access
`test_denylist_allowlist_check_denylist`	Ensures denylisted plugins are blocked	`mock_config_denylist_allowlist_check`
`test_denylist_allowlist_check_allowlist`	Ensures allowlisted plugins are allowed	`mock_config_denylist_allowlist_check`
`test_denylist_allowlist_check_user_input_yes`	Tests user approval flow	`mock_config_denylist_allowlist_check` + monkeypatch
`test_denylist_allowlist_check_user_input_no`	Tests user rejection flow	`mock_config_denylist_allowlist_check` + monkeypatch
`test_denylist_allowlist_check_user_input_invalid`	Tests invalid input handling	`mock_config_denylist_allowlist_check` + monkeypatch
`test_scan_plugins_openai`	Tests OpenAI plugin scanning	`mock_config_openai_plugin`
`test_scan_plugins_generic`	Tests generic plugin scanning	`mock_config_generic_plugin`

Sources: tests/unit_tests/test_plugins.py1-130

Writing Tests

Test File Naming Conventions

Convention	Purpose	Discovery	Example Files
`test_*.py`	pytest test files	Automatically discovered by pytest	`test_plugins.py`, `test_pgvector.py`
`*_test.py`	Example/demo scripts	Not discovered by pytest by default	`url_test.py`, `document_test.py`

Pytest automatically discovers files matching the test_*.py pattern. Files with the *_test.py pattern are typically example scripts demonstrating usage rather than automated tests. To run these, they must be explicitly specified: pytest tests/unit_tests/embedding_engine/url_test.py.

Test Function Patterns

Basic Test Function:

Test with Fixture:

Test with Monkeypatch:

Test Organization Guidelines

Group related tests: Use classes or modules to group related test functions
Use descriptive names: Test names should clearly indicate what is being tested
One assertion per test: Focus each test on a single behavior
Use fixtures for setup: Avoid repetitive setup code
Mock external dependencies: Keep tests isolated and fast

Sources: tests/unit_tests/test_plugins.py19-130

Running Tests

Test Execution Commands

Run all tests:

Run unit tests only:

Run integration tests only:

Run specific test file:

Run specific test function:

Run tests matching pattern:

Run with verbose output:

Run with coverage report:

Run tests with JUnit XML output:

Test Execution Flow

pytest Execution Flow with Code References

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py1-14

Test Utilities and Common Patterns

Import Validation Pattern

The test_vetorestore_imports function demonstrates a pattern for validating that all implementations properly inherit from base classes:

tests/unit_tests/vector_store/test_pgvector.py5-13

Implementation details:

This pattern:

Accesses dbgpt.storage.vector_store.__all__ to enumerate all implementations
Uses getattr(vector_store, cls) to dynamically retrieve (store_cls, config_cls) tuples
Validates IndexStoreBase inheritance for store classes
Validates IndexStoreConfig inheritance for configuration classes
Applicable to other plugin/extension systems with similar base class requirements

Mock Configuration Pattern

Mock configuration objects simulate the dbgpt._private.config.Config class for testing:

tests/unit_tests/test_plugins.py30-37

The MockConfig class:

Implementation characteristics:

Mimics dbgpt._private.config.Config interface
Provides minimal attributes required by the function under test
Embedded within @pytest.fixture decorator for reusability
Avoids dependency on actual configuration file parsing
Enables deterministic test behavior through hardcoded values

User Input Simulation Pattern

Testing interactive features using the monkeypatch fixture:

tests/unit_tests/test_plugins.py59-66

Implementation:

Pattern characteristics:

Injects monkeypatch fixture as function parameter
Calls monkeypatch.setattr("builtins.input", lambda _: "y") to replace the global input() function
Lambda function ignores prompt parameter and returns predetermined string
Enables testing of denylist_allowlist_check() without requiring actual user interaction
Tests different responses: "y" (yes), "n" (no), "invalid" (invalid input)

Embedding Engine Test Pattern

The embedding engine tests use a consistent three-step pattern:

Step 1: Configuration setup tests/unit_tests/embedding_engine/url_test.py3-11

Step 2: Engine initialization tests/unit_tests/embedding_engine/url_test.py12-17

Step 3: Execution tests/unit_tests/embedding_engine/url_test.py20

This pattern applies to testing other KnowledgeType values: DOCUMENT, TEXT, KNOWLEDGE_BASE.

Sources: tests/unit_tests/vector_store/test_pgvector.py5-13 tests/unit_tests/test_plugins.py28-66 tests/unit_tests/embedding_engine/url_test.py1-21

Testing Best Practices

Test Isolation

Principle: Each test should be independent and not rely on the execution of other tests.

Implementation:

Use fixtures for setup/teardown
Avoid shared mutable state
Clean up resources after tests

Example: tests/unit_tests/test_plugins.py28-38 - Each test gets a fresh mock config

Mock External Dependencies

Principle: Tests should not depend on external services, file systems, or databases unless testing integration.

Implementation:

Use mock objects for external services
Use monkeypatch for replacing functions
Create in-memory test data

Example: tests/unit_tests/test_plugins.py45-48 - Mocking user input

Descriptive Test Names

Principle: Test names should clearly communicate what is being tested and the expected behavior.

Pattern: test_<component>_<scenario>_<expected_result>

Examples:

test_denylist_allowlist_check_denylist - Tests denylist checking
test_denylist_allowlist_check_user_input_yes - Tests user approval flow
test_scan_plugins_openai - Tests OpenAI plugin scanning

Test Documentation

Principle: Tests serve as documentation for expected behavior.

Implementation:

Add docstrings to test functions
Add docstrings to fixtures
Include comments for complex setup

Example: tests/unit_tests/test_plugins.py31-32 - Mock config with descriptive docstring

Fixture Reusability

Principle: Common setup should be extracted into reusable fixtures.

Implementation:

Create fixtures for common mock objects
Use fixture scope appropriately (function, class, module, session)
Use @pytest.fixture decorator

Examples:

tests/unit_tests/test_plugins.py28-38 - Reusable mock config fixture
tests/unit_tests/test_plugins.py89-102 - OpenAI plugin mock config

Test Coverage

Principle: Aim for high test coverage, especially for critical paths.

Focus Areas:

Core business logic
Error handling paths
Edge cases and boundary conditions
Security-critical code (e.g., plugin validation)

Example: Plugin tests cover multiple scenarios:

Valid plugins: tests/unit_tests/test_plugins.py51-56
Invalid plugins: tests/unit_tests/test_plugins.py41-48
User interactions: tests/unit_tests/test_plugins.py59-86

Test Performance

Principle: Unit tests should run quickly to enable fast feedback loops.

Implementation:

Keep tests focused and minimal
Use mocks to avoid slow operations
Separate slow integration tests from fast unit tests
Use pytest markers for categorizing tests

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py5-13

Test Configuration and Setup

Test Data Location

Resource Type	Location	Purpose	Referenced In
Test plugins	`tests/unit/data/test_plugins/` or `data/test_plugins/`	Plugin ZIP files and modules for `inspect_zip_for_modules()`	`PLUGINS_TEST_DIR`, `PLUGINS_TEST_DIR_TEMP` constants
Test documents	Configured per test via `knowledge_source` parameter	Sample `.md`, `.pdf`, `.docx`, `.csv`, `.html` files	`url_test.py`, `document_test.py`
Test fixtures	Within test files using `@pytest.fixture`	Mock configuration objects (`MockConfig` classes)	`mock_config_denylist_allowlist_check`, etc.

Example: tests/unit_tests/test_plugins.py12-13 defines PLUGINS_TEST_DIR and PLUGINS_TEST_DIR_TEMP constants for plugin test data location.

Required Test Configuration

Embedding Engine Tests Configuration

Tests using dbgpt.EmbeddingEngine require:

Parameter	Type	Example Value	Purpose
`embedding_model`	`str`	`"your_embedding_model"`	Embedding model identifier
`vector_store_type`	`str`	`"Chroma"`	Vector store implementation name
`chroma_persist_path`	`str`	`"your_persist_path"`	Local directory for Chroma persistence
`knowledge_source`	`str`	`"https://docs.dbgpt.site/docs/overview"` or `"path/to/file.md"`	URL or file path to embed
`vector_store_name`	`str`	`"document_test"`	Unique identifier for vector store instance

Plugin Tests Configuration

Tests using dbgpt.plugins functions require MockConfig with:

Attribute	Type	Example Value	Purpose
`plugins_dir`	`str`	`f"{current_dir}/data/test_plugins/"`	Directory containing plugin files
`plugins_denylist`	`List[str]`	`["BadPlugin"]`	List of blocked plugin names
`plugins_allowlist`	`List[str]`	`["GoodPlugin"]`	List of approved plugin names
`plugins_openai`	`List[str]`	`["https://weathergpt.vercel.app/"]`	List of OpenAI plugin URLs

Sources: tests/unit_tests/embedding_engine/url_test.py3-11 tests/unit_tests/test_plugins.py12-16 tests/unit_tests/test_plugins.py93-101

Integration Testing

Integration tests validate end-to-end workflows involving multiple components. These tests are located in tests/intetration_tests/ (note the spelling in the codebase).

Integration Test Organization

Integration Test Characteristics

Differences from Unit Tests:

Test multiple components together
May use real external services (databases, embeddings)
Longer execution time
More complex setup/teardown
Test actual workflows end-to-end

When to Use Integration Tests:

Validating component interactions
Testing data flow through multiple layers
Verifying external service integrations
End-to-end feature validation

Sources: tests/intetration_tests/vector_store/__init__.py1 tests/intetration_tests/kbqa/__init__.py1

Test Artifacts and Ignored Files

The .gitignore file defines test-related artifacts that are excluded from version control.

Test Artifacts in .gitignore

Coverage and Test Output Files

.gitignore50-62

Pattern	Description	Generated By
`htmlcov/`	HTML coverage report directory	`pytest --cov-report=html`
`.tox/`	tox virtual environment directory	`tox` test automation
`.nox/`	nox virtual environment directory	`nox` test automation
`.coverage`	Coverage data file	`pytest-cov` / `coverage.py`
`.coverage.*`	Coverage data files for parallel runs	`coverage combine`
`.cache`	pytest cache directory	pytest
`nosetests.xml`	Nose test runner XML output	`nosetests` (legacy)
`coverage.xml`	XML coverage report	`pytest --cov-report=xml`
`*.cover`	Coverage report files	`coverage report`
`*.py,cover`	Coverage annotated Python files	`coverage annotate`
`.hypothesis/`	Hypothesis testing database	`pytest-hypothesis`
`.pytest_cache/`	pytest cache directory	pytest session cache

Test Data and Temporary Files

.gitignore152-156

Pattern	Description
`.plugin_env`	Plugin environment configuration
`/pilot/meta_data/alembic/versions/*`	Database migration versions
`/pilot/meta_data/*.db`	Test metadata databases
`/pilot/benchmark_meta_data/*.db`	Benchmark test databases
`/pilot/benchmark_meta_data/result/*`	Benchmark test results

These directories and files are generated during test execution and should not be committed to the repository.

Sources: .gitignore50-62 .gitignore152-156

GitHub Actions CI Pipeline

DB-GPT uses GitHub Actions to automate testing and quality checks on every pull request and push to the main branch.

CI Workflow Configuration

The test workflow runs automatically on:

Pull requests to main branch
Direct pushes to main branch
Changes to dbgpt/**, pilot/meta_data/**, or workflow files

CI Execution Matrix

The workflow uses a matrix strategy defined to test across multiple environments:

Parameter	Values	Purpose
Operating System	`ubuntu-latest`, `macos-latest`	Cross-platform compatibility testing
Python Version	`3.10`, `3.11`	Multi-version Python support
Total Combinations	4 (2 OS × 2 Python versions)	Comprehensive environment coverage

This creates four parallel CI jobs: ubuntu-3.10, ubuntu-3.11, macos-3.10, macos-3.11.

CI Pipeline Steps

CI Workflow Execution Flow

Setup Phase

The CI workflow performs environment setup:

Uses actions/checkout@v4 to clone the repository
Uses actions/setup-python@v4 to install Python (version from matrix)
Runs python -m pip install --upgrade pip to update pip
Executes pip install -e ".[openai]" to install DB-GPT with OpenAI extras
Executes pip install -r requirements/dev-requirements.txt to install development dependencies

Test Execution Phase

The workflow runs pytest with reporting options:

Command breakdown:

pytest dbgpt: Run tests in the dbgpt/ package directory
--cov=dbgpt: Measure code coverage for the dbgpt package
--cov-report=xml: Generate machine-readable XML coverage report with matrix-specific filename
--cov-report=html: Generate human-readable HTML coverage report with matrix-specific directory name
--junitxml: Generate JUnit XML test results with matrix-specific filename

Coverage Report Generation

On Ubuntu runs only, the workflow parses the coverage-${{ matrix.python-version }}-${{ matrix.os }}.xml file to extract package-level line coverage rates:

This extracts top-level package coverage (e.g., dbgpt.rag, dbgpt.model, dbgpt.core) and their line coverage rates.

Test Report Parsing

The workflow extracts test statistics from the pytest_report-${{ matrix.python-version }}-${{ matrix.os }}.xml file:

Extracted attributes:

tests="N": Total number of test functions executed
failures="N": Number of test failures
skipped="N": Number of skipped tests

Artifact Upload

All test and coverage reports are uploaded using actions/upload-artifact@v3:

Artifact name pattern: test-and-coverage-results-${{ matrix.python-version }}-${{ matrix.os }}
Artifact contents:
- coverage-${{ matrix.python-version }}-${{ matrix.os }}.xml: XML coverage data
- htmlcov-${{ matrix.python-version }}-${{ matrix.os }}/: HTML coverage report directory
- pytest_report-${{ matrix.python-version }}-${{ matrix.os }}.xml: JUnit test results
Artifacts available for download from GitHub Actions run page
Retention period: GitHub's default (90 days for public repos)

Example artifact names:

test-and-coverage-results-3.10-ubuntu-latest
test-and-coverage-results-3.11-macos-latest

Viewing CI Results

In Pull Requests:

Navigate to the "Checks" tab on the pull request
Select "Test Python" workflow
View test results for each matrix configuration
Download artifacts from the workflow run

Coverage Reports:

XML reports: Machine-readable coverage data
HTML reports: Browse coverage line-by-line in htmlcov-{version}-{os}/index.html

Test Reports:

JUnit XML: Standard format for test results
Compatible with test result visualization tools

Sources: .github/workflows/test-python.yml90-99

Coverage Reporting

DB-GPT uses pytest-cov (a pytest plugin wrapping coverage.py) to measure code coverage during test execution.

Coverage Metrics

Coverage Types and Report Formats

Coverage Report File Structure

Line Coverage:

Percentage of executable code lines executed during test runs
Calculated per file, per package, and overall
Reported in XML as line-rate="0.85" (85% coverage)
Most commonly used metric for code coverage tracking

Branch Coverage:

Percentage of conditional branches (if/else, try/except) executed
Ensures both true and false paths of conditionals are tested
More comprehensive than line coverage
Reported in XML as branch-rate="0.75" (75% branch coverage)

Sources: .gitignore50-62

Running Coverage Locally

Generate HTML Coverage Report:

This creates the htmlcov/ directory (excluded from git via .gitignore:51).

View HTML Report:

Generate Terminal Report with Missing Lines:

This displays coverage summary in terminal and lists specific line numbers not covered.

Generate XML Report:

This creates coverage.xml (excluded from git via .gitignore:58), which CI pipelines parse.

Coverage Configuration Options

Option	Description	Output File/Directory	Excluded by .gitignore
`--cov=<package>`	Measure coverage for package	`.coverage` data file	Yes (`.coverage`)
`--cov=dbgpt`	Measure coverage for dbgpt package	`.coverage`	Yes
`--cov-report=html`	Generate HTML report	`htmlcov/` directory	Yes (`htmlcov/`)
`--cov-report=xml`	Generate XML report	`coverage.xml`	Yes (`coverage.xml`)
`--cov-report=term`	Terminal summary	stdout	N/A
`--cov-report=term-missing`	Show missing lines in terminal	stdout	N/A
`--cov-fail-under=<min>`	Fail if coverage < threshold	Exit code	N/A
`--cov-branch`	Measure branch coverage	Adds branch data to reports	N/A

Reading Coverage Reports

The HTML coverage report (htmlcov/index.html) displays:

Report Element	Description	Example
Overall coverage percentage	Total line coverage across all measured code	`85%`
Package breakdown	Coverage per package/module	`dbgpt.rag: 82%`, `dbgpt.model: 78%`
File-level coverage	Individual file coverage with percentages	`dbgpt/core/interface/llm.py: 91%`
Line-by-line view	Color-coded source code display	Green (covered), red (not covered), yellow (partially covered)
Missing lines	Specific line numbers not executed	`Lines 45-52, 67 not covered`
Branch coverage	Conditional branch execution status	`Branch 3->4 not taken`

The XML coverage report (coverage.xml) contains:

<package name="dbgpt.rag" line-rate="0.82" branch-rate="0.75">: Package-level metrics
<class name="EmbeddingEngine" filename="dbgpt/rag/embedding.py" line-rate="0.90">: Class-level metrics
<line number="45" hits="0"/>: Line-level hit counts

Sources: .gitignore50-62

Coverage Best Practices

Target high coverage for critical code: Aim for >80% line coverage on core business logic (dbgpt.rag, dbgpt.core, dbgpt.model)
Don't chase 100% coverage: Some code paths (defensive error handling, rare edge cases) may not justify testing complexity
Review uncovered lines: Use --cov-report=term-missing to identify specific lines not tested
Add tests for uncovered branches: Use --cov-branch to ensure both true and false paths of conditionals are tested
Use coverage to find dead code: Consistently 0% coverage may indicate unused or unreachable code
Exclude tests from coverage: The .gitignore excludes tests/ from coverage measurement to focus on production code

Sources: .gitignore50-62

Pre-commit Hooks and Local Quality Checks

DB-GPT uses pre-commit hooks to enforce code quality standards before commits reach the repository. This catches issues early in the development process.

Pre-commit Hook Installation

Install pre-commit hooks: CONTRIBUTING.md83-86

After installation, hooks run automatically on git commit. The hooks validate:

Code formatting (ruff)
Type checking (mypy)
Linting (ruff check)
Test execution (pytest)

Pre-commit Hook Execution Flow

Sources: CONTRIBUTING.md83-146

Makefile Commands

DB-GPT provides Makefile targets for common quality checks: CONTRIBUTING.md112-130

Code Formatting:

Runs ruff format to automatically format code according to Black-compatible style.

Linting Check:

Runs ruff check without auto-fixing to verify code style compliance.

Type Checking:

Runs mypy to validate type annotations and detect type errors.

Test Execution:

Runs the full test suite using pytest.

Development Workflow: CONTRIBUTING.md100-146

Make code changes
Run make fmt to format code
Run make test to verify tests pass
Run make mypy to check types
Run make fmt-check to verify lint rules
Add and commit changes (pre-commit hooks run automatically)
Push changes

If pre-commit hooks fail, fix the issues and re-run the commit command. The hooks ensure code quality before changes reach the remote repository.

Sources: CONTRIBUTING.md83-146

Development Dependencies

DB-GPT's testing infrastructure relies on several development dependencies defined in requirements/dev-requirements.txt1-18

Testing Dependencies

Core Testing Framework

Package	Version	Purpose
`pytest`	Latest	Primary testing framework
`pytest-cov`	Latest	Coverage reporting plugin
`pytest-asyncio`	Latest	Async test support
`pytest-benchmark`	Latest	Performance benchmarking
`pytest-integration`	Latest	Integration test markers
`pytest-mock`	Latest	Mocking utilities
`pytest-recording`	Latest	VCR-based HTTP recording
`asynctest`	Latest	Async testing utilities
`aioresponses`	Latest	Async HTTP mocking

HTTP and Recording:

httpx: Modern HTTP client for testing API calls
vcrpy<6.0.0: Record and replay HTTP interactions for deterministic tests
pytesseract==0.3.10: OCR for document processing tests

Type Checking and Quality:

mypy==1.7.0: Static type checker
pre-commit: Git hook framework for automated checks

Sources: requirements/dev-requirements.txt1-18

Installing Development Dependencies

Using uv (recommended): CONTRIBUTING.md56-66

Using pip:

Sources: CONTRIBUTING.md56-66 requirements/dev-requirements.txt1-18

Dev Container Testing Environment

DB-GPT provides a Dev Container configuration for consistent development environments with pre-installed testing tools.

Dev Container Configuration

The Dev Container setup is defined in .devcontainer.json1-80 and uses a custom Docker image built from .devcontainer/Dockerfile.dev1-56

Dev Container Features

VS Code Extensions: .devcontainer.json67-77

The Dev Container automatically installs testing-related extensions:

ms-python.python: Python language support
ms-python.vscode-pylance: Python IntelliSense
ms-python.mypy-type-checker: Mypy integration
charliermarsh.ruff: Ruff linter and formatter
ms-python.flake8: Flake8 linter
ms-python.autopep8: Code formatting

Python Configuration: .devcontainer.json55-66

The Dev Container configures Python tools:

Post-create Setup: .devcontainer/post-create.sh1-70

After container creation, the script:

Installs Oh My Zsh and plugins
Configures ZSH with development tools
Sets up environment variable loading from .env

Development Workflow in Dev Container: .devcontainer/README.md13-36

Sources: .devcontainer.json1-80 .devcontainer/Dockerfile.dev1-56 .devcontainer/post-create.sh1-70 .devcontainer/README.md1-37

Code Quality Tools

DB-GPT uses static analysis tools to maintain code quality and prevent common errors before runtime. The primary tools are mypy for type checking and ruff for linting and formatting.

Type Checking with mypy

mypy is a static type checker for Python that validates type annotations and detects type-related errors before code execution. The .mypy_cache/ directory (excluded from git via .gitignore136) stores mypy's incremental checking cache.

Type Checking Configuration Structure

mypy Configuration Hierarchy

Global Configuration

.mypy.ini1-2

The global [mypy] section excludes the tests/ directory from type checking:

This exclusion is necessary because:

Test code uses mocking patterns (MockConfig, monkeypatch.setattr) that violate type safety
Test fixtures dynamically create objects that mypy cannot validate
Test code prioritizes flexibility over type safety

Core Module Configuration

Module Pattern	Configuration Options	Rationale
`[mypy-dbgpt.rag.*]`	`strict_optional=False` `ignore_missing_imports=True` `follow_imports=skip`	RAG components use optional types flexibly; many embedding/vector store libraries lack type stubs
`[mypy-dbgpt.app.*]`	`follow_imports=skip`	Application layer integrates external services without complete type information
`[mypy-dbgpt.serve.*]`	`follow_imports=skip`	Service layer uses FastAPI and other frameworks with dynamic typing patterns
`[mypy-dbgpt.model.*]`	`follow_imports=skip`	Model layer interfaces with ML frameworks (PyTorch, vLLM) that have incomplete type stubs
`[mypy-dbgpt.util.*]`	`follow_imports=skip`	Utility modules have diverse dependencies with varying type coverage

Third-party Library Configurations

DB-GPT configures mypy to handle third-party libraries lacking type stubs or having incompatible type definitions. Configuration pattern: [mypy-<library>.*] with ignore_missing_imports=True.

Storage and Vector Store Libraries

Library	Configuration Section	Purpose
`weaviate`	`[mypy-weaviate.*]`	Weaviate vector database client
`pymilvus`	`[mypy-pymilvus.*]`	Milvus vector database client
`elasticsearch`	`[mypy-elasticsearch.*]`	Elasticsearch client
`msgpack`	`[mypy-msgpack.*]`	Binary serialization library
`rocksdict`	`[mypy-rocksdict.*]`	RocksDB Python bindings
`cryptography`	`[mypy-cryptography.*]`	Cryptographic operations

Data Source Libraries

Library	Configuration Section	Purpose
`pyspark`	`[mypy-pyspark.*]`	Apache Spark Python API
`sqlparse`	`[mypy-sqlparse.*]`	SQL parsing library
`clickhouse_connect`	`[mypy-clickhouse_connect.*]`	ClickHouse database client
`neo4j`	`[mypy-neo4j.*]`	Neo4j graph database client

Agent and NLP Libraries

Library	Configuration Section	Purpose
`unstructured`	`[mypy-unstructured.*]`	Document processing library
`ollama`	`[mypy-ollama.*]`	Ollama API client
`pypdf`	`[mypy-pypdf.*]`	PDF processing library
`networkx`	`[mypy-networkx.*]`	Graph algorithms library
`seaborn`	`[mypy-seaborn.*]`	Statistical visualization
`rich`	`[mypy-rich.*]`	Terminal formatting
`qianfan`	`[mypy-qianfan.*]`	Baidu Qianfan API client

Pydantic Configuration

[mypy-pydantic.*] uses strict_optional=False and follow_imports=skip to accommodate Pydantic's dynamic model generation and metaclass patterns.

Running mypy

mypy generates a .mypy_cache/ directory (excluded from git via .gitignore136) for incremental type checking performance.

Common mypy Errors and Resolutions

Error Type	Description	Resolution
`error: Missing type annotation`	Variable lacks type hint	Add type annotation: `var: str = "value"`
`error: Incompatible types`	Type mismatch in assignment	Check type compatibility or use cast
`error: Cannot find implementation or library stub`	Missing type stubs for library	Add to `.mypy.ini` with `ignore_missing_imports=True`
`error: "Optional[X]" has no attribute "Y"`	Accessing attribute on optional type	Check for None: `if obj is not None: obj.method()`

Linting and Formatting with ruff

ruff is a fast Python linter and code formatter that combines the functionality of multiple tools (flake8, isort, black) in a single binary.

ruff Features

Running ruff

Common ruff Rules

Rule Code	Description	Category
`E501`	Line too long (>88 characters)	Style
`F401`	Module imported but unused	Pyflakes
`F841`	Local variable assigned but never used	Pyflakes
`I001`	Import block is un-sorted or un-formatted	isort
`W291`	Trailing whitespace	Whitespace
`C901`	Function is too complex	Complexity
`S101`	Use of assert detected	Security (bandit)

Integration with Development Workflow

Code Quality Best Practices

Type Annotation Guidelines

Add type hints to function signatures:
Use Optional for nullable values:
Use generic types for collections:
Use Protocol for structural typing:

Linting Best Practices

Fix auto-fixable issues: Run ruff check --fix before committing
Keep lines under 88 characters: Follow Black's line length convention
Remove unused imports: Clean up unused imports regularly
Organize imports: Use isort or ruff's import sorting
Address complexity warnings: Refactor complex functions (C901 warnings)

Configuration Management

Project-wide settings: Configure ruff and mypy in pyproject.toml or dedicated config files
Module-specific overrides: Use .mypy.ini sections for module-specific settings
IDE integration: Configure VS Code, PyCharm, or other IDEs to run quality tools on save
Pre-commit hooks: Set up pre-commit hooks to run quality checks before commits

Sources: .mypy.ini1-127 examples/rag/embedding_rag_example.py1-59 examples/rag/rag_embedding_api_example.py1-89

Summary

The DB-GPT testing framework provides comprehensive testing infrastructure:

Test Organization: Clear separation of unit and integration tests by component
Testing Framework: pytest with fixtures, parametrization, and monkeypatching
Test Types: Unit tests for isolated components, integration tests for workflows
Test Utilities: Reusable patterns for mocking, configuration, and validation
Best Practices: Test isolation, descriptive naming, fixture reusability

Key Test Categories:

Embedding Engine: URL and document embedding workflows
Vector Store: Import validation and storage operations
Plugin System: Security validation, scanning, and lifecycle management
KBQA: Knowledge-based question answering pipelines

Running Tests:

For contributing tests to the codebase, see Contributing Guidelines.

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py1-14 tests/unit_tests/embedding_engine/url_test.py1-21 tests/unit_tests/embedding_engine/document_test.py1-21

Testing and Quality Assurance

Relevant source files

Purpose and Scope

For information about the overall development workflow, see Development Environment Setup. For CI/CD pipelines and automated quality checks, see Contributing and CI/CD.

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py1-14 .mypy.ini1-127

Test Directory Structure

DB-GPT organizes tests into two primary categories: unit tests and integration tests. The test suite is located in the tests/ directory at the repository root.

Directory Organization

Test Directory Organization with Code Paths

Test Directory Organization

Directory Path	Test Type	Components Under Test	Key Test Files
`tests/unit_tests/`	Unit tests	Individual components in isolation	`test_plugins.py`
`tests/intetration_tests/`	Integration tests	Multi-component workflows	Directory structure only
`tests/unit_tests/embedding_engine/`	Unit tests	`dbgpt.EmbeddingEngine`, `KnowledgeType`	`url_test.py`, `document_test.py`
`tests/unit_tests/vector_store/`	Unit tests	`dbgpt.storage.vector_store`, `IndexStoreBase`, `IndexStoreConfig`	`test_pgvector.py`
`tests/intetration_tests/vector_store/`	Integration tests	End-to-end vector storage workflows	`__init__.py`
`tests/intetration_tests/kbqa/`	Integration tests	Knowledge-based QA pipelines	`__init__.py`

Note: The directory name intetration_tests contains an intentional misspelling in the codebase.

Test Types and Organization

Unit Tests

Unit tests focus on testing individual components in isolation. They use mocking and fixtures to eliminate external dependencies.

Example: Vector Store Import Validation

The test_vetorestore_imports function validates that all vector store classes properly implement the required base classes:

tests/unit_tests/vector_store/test_pgvector.py5-13

This test validates the structure of all vector store implementations by:

Iterating through all classes listed in dbgpt.storage.vector_store.__all__
Using getattr() to dynamically retrieve each (store_cls, config_cls) tuple
Asserting issubclass(store_cls, IndexStoreBase) for the store class
Asserting issubclass(config_cls, IndexStoreConfig) for the configuration class

This pattern ensures consistent interface implementation across all vector store backends (Milvus, Chroma, PGVector, Elasticsearch, Weaviate, OceanBase).

Example: Plugin Testing

The plugin test suite demonstrates comprehensive unit testing patterns for the dbgpt.plugins module:

tests/unit_tests/test_plugins.py19-25

The test_inspect_zip_for_modules function validates inspect_zip_for_modules() by verifying it correctly identifies Python module files within plugin ZIP archives.

Key testing patterns demonstrated:

Fixture-based mocking: tests/unit_tests/test_plugins.py28-38 creates MockConfig objects
Monkeypatching: tests/unit_tests/test_plugins.py45-46 uses monkeypatch.setattr("builtins.input", lambda _: "y") to simulate user input
Denylist/allowlist validation: tests/unit_tests/test_plugins.py41-86 tests denylist_allowlist_check() function behavior
Multiple test scenarios: Tests for user input variations (yes/no/invalid) and plugin scanning modes (OpenAI/generic)

Integration Tests

Integration tests validate end-to-end workflows involving multiple components. The directory structure separates integration tests by functional area.

Example Test Areas:

Vector Store Integration: Tests for embedding, indexing, and retrieval workflows
KBQA Integration: Tests for knowledge-based question answering pipelines

Sources: tests/unit_tests/vector_store/test_pgvector.py5-13 tests/unit_tests/test_plugins.py1-130

Testing Tools and Frameworks

Primary Testing Framework: pytest

DB-GPT uses pytest as its primary testing framework. The framework provides:

Fixtures: Reusable test setup and teardown
Parametrization: Running tests with multiple input sets
Monkeypatching: Mocking external dependencies
Assertions: Rich assertion introspection

pytest Framework Execution Flow

Fixture Example

Mock configuration fixture for plugin testing:

tests/unit_tests/test_plugins.py28-38

The mock_config_denylist_allowlist_check fixture:

Returns a MockConfig class instance
Provides plugins_denylist = ["BadPlugin"]
Provides plugins_allowlist = ["GoodPlugin"]
Defines authorise_key = "y" and exit_key = "n" for user input simulation
Used by multiple test functions via pytest's dependency injection

Monkeypatch Example

Simulating user input in plugin authorization tests:

tests/unit_tests/test_plugins.py59-66

The test_denylist_allowlist_check_user_input_yes function:

Receives monkeypatch fixture as parameter
Calls monkeypatch.setattr("builtins.input", lambda _: "y") to replace input() function
Tests denylist_allowlist_check("UnknownPlugin", mock_config_denylist_allowlist_check)
Asserts the function returns True when user approves

Sources: tests/unit_tests/test_plugins.py28-66

Test Categories by Component

Embedding Engine Tests

Embedding engine tests validate document and URL embedding functionality using the dbgpt.EmbeddingEngine class. These tests are located in tests/unit_tests/embedding_engine/.

Embedding Engine Test Code Structure

URL Embedding Test Structure

tests/unit_tests/embedding_engine/url_test.py1-20

The URL test:

Sets knowledge_source = "https://docs.dbgpt.site/docs/overview"
Uses KnowledgeType.URL.value for web content
Configures vector_store_name = url.replace(":", "") to create valid store name
Creates EmbeddingEngine instance with configuration
Calls embedding_engine.knowledge_embedding() to execute

Document Embedding Test Structure

tests/unit_tests/embedding_engine/document_test.py1-21

The document test:

Sets knowledge_source = document_path pointing to local file
Uses KnowledgeType.DOCUMENT.value for file content
Supports file types: .md, .pdf, .docx, .csv, .html
Uses vector_store_name = "document_test" as fixed identifier
Executes identical embedding workflow

Both tests follow the pattern:

Configure embedding_model, vector_store_type, and chroma_persist_path
Create vector_store_config dictionary with required keys
Initialize EmbeddingEngine with knowledge_source, knowledge_type, model_name, and vector_store_config
Call knowledge_embedding() method to execute embedding pipeline

Supported Document Types

Format	File Extension	Notes
Markdown	`.md`	Text-based documentation
PDF	`.pdf`	Portable document format
Word	`.docx`	Microsoft Word documents
CSV	`.csv`	Comma-separated values
HTML	`.html`	Web pages

Sources: tests/unit_tests/embedding_engine/url_test.py1-21 tests/unit_tests/embedding_engine/document_test.py1-21

Plugin Testing Infrastructure

Plugin testing validates the plugin system's security and functionality, including denylist/allowlist checks and plugin scanning.

Plugin Test Structure

Plugin Test Function Mapping to Code Under Test

Plugin Test Constants

tests/unit_tests/test_plugins.py12-16

Test resource constants:

Constant	Value	Purpose
`PLUGINS_TEST_DIR`	`"tests/unit/data/test_plugins"`	Original test plugin directory path
`PLUGINS_TEST_DIR_TEMP`	`"data/test_plugins"`	Alternative test plugin directory
`PLUGIN_TEST_ZIP_FILE`	`"Auto-GPT-Plugin-Test-master.zip"`	Sample plugin ZIP for `inspect_zip_for_modules()` tests
`PLUGIN_TEST_INIT_PY`	`"Auto-GPT-Plugin-Test-master/src/auto_gpt_vicuna/__init__.py"`	Expected module path within ZIP
`PLUGIN_TEST_OPENAI`	`"https://weathergpt.vercel.app/"`	OpenAI plugin URL for `scan_plugins()` tests

Mock Configuration Objects

Denylist/Allowlist Mock Config:

tests/unit_tests/test_plugins.py28-38

This mock provides:

plugins_denylist: List of blocked plugins
plugins_allowlist: List of approved plugins
authorise_key / exit_key: User input options

OpenAI Plugin Mock Config:

tests/unit_tests/test_plugins.py89-102

Generic Plugin Mock Config:

tests/unit_tests/test_plugins.py111-123

Test Scenarios

Test Function	Purpose	Mock/Fixture Used
`test_inspect_zip_for_modules`	Validates ZIP file module inspection	File system access
`test_denylist_allowlist_check_denylist`	Ensures denylisted plugins are blocked	`mock_config_denylist_allowlist_check`
`test_denylist_allowlist_check_allowlist`	Ensures allowlisted plugins are allowed	`mock_config_denylist_allowlist_check`
`test_denylist_allowlist_check_user_input_yes`	Tests user approval flow	`mock_config_denylist_allowlist_check` + monkeypatch
`test_denylist_allowlist_check_user_input_no`	Tests user rejection flow	`mock_config_denylist_allowlist_check` + monkeypatch
`test_denylist_allowlist_check_user_input_invalid`	Tests invalid input handling	`mock_config_denylist_allowlist_check` + monkeypatch
`test_scan_plugins_openai`	Tests OpenAI plugin scanning	`mock_config_openai_plugin`
`test_scan_plugins_generic`	Tests generic plugin scanning	`mock_config_generic_plugin`

Sources: tests/unit_tests/test_plugins.py1-130

Writing Tests

Test File Naming Conventions

Convention	Purpose	Discovery	Example Files
`test_*.py`	pytest test files	Automatically discovered by pytest	`test_plugins.py`, `test_pgvector.py`
`*_test.py`	Example/demo scripts	Not discovered by pytest by default	`url_test.py`, `document_test.py`

Test Function Patterns

Basic Test Function:

Test with Fixture:

Test with Monkeypatch:

Test Organization Guidelines

Group related tests: Use classes or modules to group related test functions
Use descriptive names: Test names should clearly indicate what is being tested
One assertion per test: Focus each test on a single behavior
Use fixtures for setup: Avoid repetitive setup code
Mock external dependencies: Keep tests isolated and fast

Sources: tests/unit_tests/test_plugins.py19-130

Running Tests

Test Execution Commands

Run all tests:

Run unit tests only:

Run integration tests only:

Run specific test file:

Run specific test function:

Run tests matching pattern:

Run with verbose output:

Run with coverage report:

Run tests with JUnit XML output:

Test Execution Flow

pytest Execution Flow with Code References

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py1-14

Test Utilities and Common Patterns

Import Validation Pattern

The test_vetorestore_imports function demonstrates a pattern for validating that all implementations properly inherit from base classes:

tests/unit_tests/vector_store/test_pgvector.py5-13

Implementation details:

This pattern:

Accesses dbgpt.storage.vector_store.__all__ to enumerate all implementations
Uses getattr(vector_store, cls) to dynamically retrieve (store_cls, config_cls) tuples
Validates IndexStoreBase inheritance for store classes
Validates IndexStoreConfig inheritance for configuration classes
Applicable to other plugin/extension systems with similar base class requirements

Mock Configuration Pattern

Mock configuration objects simulate the dbgpt._private.config.Config class for testing:

tests/unit_tests/test_plugins.py30-37

The MockConfig class:

Implementation characteristics:

Mimics dbgpt._private.config.Config interface
Provides minimal attributes required by the function under test
Embedded within @pytest.fixture decorator for reusability
Avoids dependency on actual configuration file parsing
Enables deterministic test behavior through hardcoded values

User Input Simulation Pattern

Testing interactive features using the monkeypatch fixture:

tests/unit_tests/test_plugins.py59-66

Implementation:

Pattern characteristics:

Injects monkeypatch fixture as function parameter
Calls monkeypatch.setattr("builtins.input", lambda _: "y") to replace the global input() function
Lambda function ignores prompt parameter and returns predetermined string
Enables testing of denylist_allowlist_check() without requiring actual user interaction
Tests different responses: "y" (yes), "n" (no), "invalid" (invalid input)

Embedding Engine Test Pattern

The embedding engine tests use a consistent three-step pattern:

Step 1: Configuration setup tests/unit_tests/embedding_engine/url_test.py3-11

Step 2: Engine initialization tests/unit_tests/embedding_engine/url_test.py12-17

Step 3: Execution tests/unit_tests/embedding_engine/url_test.py20

This pattern applies to testing other KnowledgeType values: DOCUMENT, TEXT, KNOWLEDGE_BASE.

Sources: tests/unit_tests/vector_store/test_pgvector.py5-13 tests/unit_tests/test_plugins.py28-66 tests/unit_tests/embedding_engine/url_test.py1-21

Testing Best Practices

Test Isolation

Principle: Each test should be independent and not rely on the execution of other tests.

Implementation:

Use fixtures for setup/teardown
Avoid shared mutable state
Clean up resources after tests

Example: tests/unit_tests/test_plugins.py28-38 - Each test gets a fresh mock config

Mock External Dependencies

Principle: Tests should not depend on external services, file systems, or databases unless testing integration.

Implementation:

Use mock objects for external services
Use monkeypatch for replacing functions
Create in-memory test data

Example: tests/unit_tests/test_plugins.py45-48 - Mocking user input

Descriptive Test Names

Principle: Test names should clearly communicate what is being tested and the expected behavior.

Pattern: test_<component>_<scenario>_<expected_result>

Examples:

test_denylist_allowlist_check_denylist - Tests denylist checking
test_denylist_allowlist_check_user_input_yes - Tests user approval flow
test_scan_plugins_openai - Tests OpenAI plugin scanning

Test Documentation

Principle: Tests serve as documentation for expected behavior.

Implementation:

Add docstrings to test functions
Add docstrings to fixtures
Include comments for complex setup

Example: tests/unit_tests/test_plugins.py31-32 - Mock config with descriptive docstring

Fixture Reusability

Principle: Common setup should be extracted into reusable fixtures.

Implementation:

Create fixtures for common mock objects
Use fixture scope appropriately (function, class, module, session)
Use @pytest.fixture decorator

Examples:

tests/unit_tests/test_plugins.py28-38 - Reusable mock config fixture
tests/unit_tests/test_plugins.py89-102 - OpenAI plugin mock config

Test Coverage

Principle: Aim for high test coverage, especially for critical paths.

Focus Areas:

Core business logic
Error handling paths
Edge cases and boundary conditions
Security-critical code (e.g., plugin validation)

Example: Plugin tests cover multiple scenarios:

Valid plugins: tests/unit_tests/test_plugins.py51-56
Invalid plugins: tests/unit_tests/test_plugins.py41-48
User interactions: tests/unit_tests/test_plugins.py59-86

Test Performance

Principle: Unit tests should run quickly to enable fast feedback loops.

Implementation:

Keep tests focused and minimal
Use mocks to avoid slow operations
Separate slow integration tests from fast unit tests
Use pytest markers for categorizing tests

Sources: tests/unit_tests/test_plugins.py1-130 tests/unit_tests/vector_store/test_pgvector.py5-13

Test Configuration and Setup

Test Data Location

Resource Type	Location	Purpose	Referenced In
Test plugins	`tests/unit/data/test_plugins/` or `data/test_plugins/`	Plugin ZIP files and modules for `inspect_zip_for_modules()`	`PLUGINS_TEST_DIR`, `PLUGINS_TEST_DIR_TEMP` constants
Test documents	Configured per test via `knowledge_source` parameter	Sample `.md`, `.pdf`, `.docx`, `.csv`, `.html` files	`url_test.py`, `document_test.py`
Test fixtures	Within test files using `@pytest.fixture`	Mock configuration objects (`MockConfig` classes)	`mock_config_denylist_allowlist_check`, etc.

Example: tests/unit_tests/test_plugins.py12-13 defines PLUGINS_TEST_DIR and PLUGINS_TEST_DIR_TEMP constants for plugin test data location.

Required Test Configuration

Embedding Engine Tests Configuration

Tests using dbgpt.EmbeddingEngine require:

Parameter	Type	Example Value	Purpose
`embedding_model`	`str`	`"your_embedding_model"`	Embedding model identifier
`vector_store_type`	`str`	`"Chroma"`	Vector store implementation name
`chroma_persist_path`	`str`	`"your_persist_path"`	Local directory for Chroma persistence
`knowledge_source`	`str`	`"https://docs.dbgpt.site/docs/overview"` or `"path/to/file.md"`	URL or file path to embed
`vector_store_name`	`str`	`"document_test"`	Unique identifier for vector store instance

Plugin Tests Configuration

Tests using dbgpt.plugins functions require MockConfig with:

Attribute	Type	Example Value	Purpose
`plugins_dir`	`str`	`f"{current_dir}/data/test_plugins/"`	Directory containing plugin files
`plugins_denylist`	`List[str]`	`["BadPlugin"]`	List of blocked plugin names
`plugins_allowlist`	`List[str]`	`["GoodPlugin"]`	List of approved plugin names
`plugins_openai`	`List[str]`	`["https://weathergpt.vercel.app/"]`	List of OpenAI plugin URLs

Sources: tests/unit_tests/embedding_engine/url_test.py3-11 tests/unit_tests/test_plugins.py12-16 tests/unit_tests/test_plugins.py93-101

Integration Testing

Integration tests validate end-to-end workflows involving multiple components. These tests are located in tests/intetration_tests/ (note the spelling in the codebase).

Integration Test Organization

Integration Test Characteristics

Differences from Unit Tests:

Test multiple components together
May use real external services (databases, embeddings)
Longer execution time
More complex setup/teardown
Test actual workflows end-to-end

When to Use Integration Tests:

Validating component interactions
Testing data flow through multiple layers
Verifying external service integrations
End-to-end feature validation

Sources: tests/intetration_tests/vector_store/__init__.py1 tests/intetration_tests/kbqa/__init__.py1

Test Artifacts and Ignored Files

The .gitignore file defines test-related artifacts that are excluded from version control.

Test Artifacts in .gitignore

Coverage and Test Output Files

.gitignore50-62

Pattern	Description	Generated By
`htmlcov/`	HTML coverage report directory	`pytest --cov-report=html`
`.tox/`	tox virtual environment directory	`tox` test automation
`.nox/`	nox virtual environment directory	`nox` test automation
`.coverage`	Coverage data file	`pytest-cov` / `coverage.py`
`.coverage.*`	Coverage data files for parallel runs	`coverage combine`
`.cache`	pytest cache directory	pytest
`nosetests.xml`	Nose test runner XML output	`nosetests` (legacy)
`coverage.xml`	XML coverage report	`pytest --cov-report=xml`
`*.cover`	Coverage report files	`coverage report`
`*.py,cover`	Coverage annotated Python files	`coverage annotate`
`.hypothesis/`	Hypothesis testing database	`pytest-hypothesis`
`.pytest_cache/`	pytest cache directory	pytest session cache

Test Data and Temporary Files

.gitignore152-156

Pattern	Description
`.plugin_env`	Plugin environment configuration
`/pilot/meta_data/alembic/versions/*`	Database migration versions
`/pilot/meta_data/*.db`	Test metadata databases
`/pilot/benchmark_meta_data/*.db`	Benchmark test databases
`/pilot/benchmark_meta_data/result/*`	Benchmark test results

These directories and files are generated during test execution and should not be committed to the repository.

Sources: .gitignore50-62 .gitignore152-156

GitHub Actions CI Pipeline

DB-GPT uses GitHub Actions to automate testing and quality checks on every pull request and push to the main branch.

CI Workflow Configuration

The test workflow runs automatically on:

Pull requests to main branch
Direct pushes to main branch
Changes to dbgpt/**, pilot/meta_data/**, or workflow files

CI Execution Matrix

The workflow uses a matrix strategy defined to test across multiple environments:

Parameter	Values	Purpose
Operating System	`ubuntu-latest`, `macos-latest`	Cross-platform compatibility testing
Python Version	`3.10`, `3.11`	Multi-version Python support
Total Combinations	4 (2 OS × 2 Python versions)	Comprehensive environment coverage

This creates four parallel CI jobs: ubuntu-3.10, ubuntu-3.11, macos-3.10, macos-3.11.

CI Pipeline Steps

CI Workflow Execution Flow

Setup Phase

The CI workflow performs environment setup:

Uses actions/checkout@v4 to clone the repository
Uses actions/setup-python@v4 to install Python (version from matrix)
Runs python -m pip install --upgrade pip to update pip
Executes pip install -e ".[openai]" to install DB-GPT with OpenAI extras
Executes pip install -r requirements/dev-requirements.txt to install development dependencies

Test Execution Phase

The workflow runs pytest with reporting options:

Command breakdown:

pytest dbgpt: Run tests in the dbgpt/ package directory
--cov=dbgpt: Measure code coverage for the dbgpt package
--cov-report=xml: Generate machine-readable XML coverage report with matrix-specific filename
--cov-report=html: Generate human-readable HTML coverage report with matrix-specific directory name
--junitxml: Generate JUnit XML test results with matrix-specific filename

Coverage Report Generation

On Ubuntu runs only, the workflow parses the coverage-${{ matrix.python-version }}-${{ matrix.os }}.xml file to extract package-level line coverage rates:

This extracts top-level package coverage (e.g., dbgpt.rag, dbgpt.model, dbgpt.core) and their line coverage rates.

Test Report Parsing

The workflow extracts test statistics from the pytest_report-${{ matrix.python-version }}-${{ matrix.os }}.xml file:

Extracted attributes:

tests="N": Total number of test functions executed
failures="N": Number of test failures
skipped="N": Number of skipped tests

Artifact Upload

All test and coverage reports are uploaded using actions/upload-artifact@v3:

Artifact name pattern: test-and-coverage-results-${{ matrix.python-version }}-${{ matrix.os }}
Artifact contents:
- coverage-${{ matrix.python-version }}-${{ matrix.os }}.xml: XML coverage data
- htmlcov-${{ matrix.python-version }}-${{ matrix.os }}/: HTML coverage report directory
- pytest_report-${{ matrix.python-version }}-${{ matrix.os }}.xml: JUnit test results
Artifacts available for download from GitHub Actions run page
Retention period: GitHub's default (90 days for public repos)

Example artifact names:

test-and-coverage-results-3.10-ubuntu-latest
test-and-coverage-results-3.11-macos-latest

Viewing CI Results

In Pull Requests:

Navigate to the "Checks" tab on the pull request
Select "Test Python" workflow
View test results for each matrix configuration
Download artifacts from the workflow run

Coverage Reports:

XML reports: Machine-readable coverage data
HTML reports: Browse coverage line-by-line in htmlcov-{version}-{os}/index.html

Test Reports:

JUnit XML: Standard format for test results
Compatible with test result visualization tools

Sources: .github/workflows/test-python.yml90-99

Coverage Reporting

DB-GPT uses pytest-cov (a pytest plugin wrapping coverage.py) to measure code coverage during test execution.

Coverage Metrics

Coverage Types and Report Formats

Coverage Report File Structure

Line Coverage:

Percentage of executable code lines executed during test runs
Calculated per file, per package, and overall
Reported in XML as line-rate="0.85" (85% coverage)
Most commonly used metric for code coverage tracking

Branch Coverage:

Percentage of conditional branches (if/else, try/except) executed
Ensures both true and false paths of conditionals are tested
More comprehensive than line coverage
Reported in XML as branch-rate="0.75" (75% branch coverage)

Sources: .gitignore50-62

Running Coverage Locally

Generate HTML Coverage Report:

This creates the htmlcov/ directory (excluded from git via .gitignore:51).

View HTML Report:

Generate Terminal Report with Missing Lines:

This displays coverage summary in terminal and lists specific line numbers not covered.

Generate XML Report:

This creates coverage.xml (excluded from git via .gitignore:58), which CI pipelines parse.

Coverage Configuration Options

Option	Description	Output File/Directory	Excluded by .gitignore
`--cov=<package>`	Measure coverage for package	`.coverage` data file	Yes (`.coverage`)
`--cov=dbgpt`	Measure coverage for dbgpt package	`.coverage`	Yes
`--cov-report=html`	Generate HTML report	`htmlcov/` directory	Yes (`htmlcov/`)
`--cov-report=xml`	Generate XML report	`coverage.xml`	Yes (`coverage.xml`)
`--cov-report=term`	Terminal summary	stdout	N/A
`--cov-report=term-missing`	Show missing lines in terminal	stdout	N/A
`--cov-fail-under=<min>`	Fail if coverage < threshold	Exit code	N/A
`--cov-branch`	Measure branch coverage	Adds branch data to reports	N/A

Reading Coverage Reports

The HTML coverage report (htmlcov/index.html) displays:

Report Element	Description	Example
Overall coverage percentage	Total line coverage across all measured code	`85%`
Package breakdown	Coverage per package/module	`dbgpt.rag: 82%`, `dbgpt.model: 78%`
File-level coverage	Individual file coverage with percentages	`dbgpt/core/interface/llm.py: 91%`
Line-by-line view	Color-coded source code display	Green (covered), red (not covered), yellow (partially covered)
Missing lines	Specific line numbers not executed	`Lines 45-52, 67 not covered`
Branch coverage	Conditional branch execution status	`Branch 3->4 not taken`

The XML coverage report (coverage.xml) contains:

<package name="dbgpt.rag" line-rate="0.82" branch-rate="0.75">: Package-level metrics
<class name="EmbeddingEngine" filename="dbgpt/rag/embedding.py" line-rate="0.90">: Class-level metrics
<line number="45" hits="0"/>: Line-level hit counts

Sources: .gitignore50-62

Coverage Best Practices

Target high coverage for critical code: Aim for >80% line coverage on core business logic (dbgpt.rag, dbgpt.core, dbgpt.model)
Don't chase 100% coverage: Some code paths (defensive error handling, rare edge cases) may not justify testing complexity
Review uncovered lines: Use --cov-report=term-missing to identify specific lines not tested
Add tests for uncovered branches: Use --cov-branch to ensure both true and false paths of conditionals are tested
Use coverage to find dead code: Consistently 0% coverage may indicate unused or unreachable code
Exclude tests from coverage: The .gitignore excludes tests/ from coverage measurement to focus on production code

Sources: .gitignore50-62

Pre-commit Hooks and Local Quality Checks

DB-GPT uses pre-commit hooks to enforce code quality standards before commits reach the repository. This catches issues early in the development process.

Pre-commit Hook Installation

Install pre-commit hooks: CONTRIBUTING.md83-86

After installation, hooks run automatically on git commit. The hooks validate:

Code formatting (ruff)
Type checking (mypy)
Linting (ruff check)
Test execution (pytest)

Pre-commit Hook Execution Flow

Sources: CONTRIBUTING.md83-146

Makefile Commands

DB-GPT provides Makefile targets for common quality checks: CONTRIBUTING.md112-130

Code Formatting:

Runs ruff format to automatically format code according to Black-compatible style.

Linting Check:

Runs ruff check without auto-fixing to verify code style compliance.

Type Checking:

Runs mypy to validate type annotations and detect type errors.

Test Execution:

Runs the full test suite using pytest.

Development Workflow: CONTRIBUTING.md100-146

Make code changes
Run make fmt to format code
Run make test to verify tests pass
Run make mypy to check types
Run make fmt-check to verify lint rules
Add and commit changes (pre-commit hooks run automatically)
Push changes

If pre-commit hooks fail, fix the issues and re-run the commit command. The hooks ensure code quality before changes reach the remote repository.

Sources: CONTRIBUTING.md83-146

Development Dependencies

DB-GPT's testing infrastructure relies on several development dependencies defined in requirements/dev-requirements.txt1-18

Testing Dependencies

Core Testing Framework

Package	Version	Purpose
`pytest`	Latest	Primary testing framework
`pytest-cov`	Latest	Coverage reporting plugin
`pytest-asyncio`	Latest	Async test support
`pytest-benchmark`	Latest	Performance benchmarking
`pytest-integration`	Latest	Integration test markers
`pytest-mock`	Latest	Mocking utilities
`pytest-recording`	Latest	VCR-based HTTP recording
`asynctest`	Latest	Async testing utilities
`aioresponses`	Latest	Async HTTP mocking

HTTP and Recording:

httpx: Modern HTTP client for testing API calls
vcrpy<6.0.0: Record and replay HTTP interactions for deterministic tests
pytesseract==0.3.10: OCR for document processing tests

Type Checking and Quality:

mypy==1.7.0: Static type checker
pre-commit: Git hook framework for automated checks

Sources: requirements/dev-requirements.txt1-18

Installing Development Dependencies

Using uv (recommended): CONTRIBUTING.md56-66

Using pip:

Sources: CONTRIBUTING.md56-66 requirements/dev-requirements.txt1-18

Dev Container Testing Environment

DB-GPT provides a Dev Container configuration for consistent development environments with pre-installed testing tools.

Dev Container Configuration

The Dev Container setup is defined in .devcontainer.json1-80 and uses a custom Docker image built from .devcontainer/Dockerfile.dev1-56

Dev Container Features

VS Code Extensions: .devcontainer.json67-77

The Dev Container automatically installs testing-related extensions:

ms-python.python: Python language support
ms-python.vscode-pylance: Python IntelliSense
ms-python.mypy-type-checker: Mypy integration
charliermarsh.ruff: Ruff linter and formatter
ms-python.flake8: Flake8 linter
ms-python.autopep8: Code formatting

Python Configuration: .devcontainer.json55-66

The Dev Container configures Python tools:

Post-create Setup: .devcontainer/post-create.sh1-70

After container creation, the script:

Installs Oh My Zsh and plugins
Configures ZSH with development tools
Sets up environment variable loading from .env

Development Workflow in Dev Container: .devcontainer/README.md13-36

Sources: .devcontainer.json1-80 .devcontainer/Dockerfile.dev1-56 .devcontainer/post-create.sh1-70 .devcontainer/README.md1-37

Code Quality Tools

DB-GPT uses static analysis tools to maintain code quality and prevent common errors before runtime. The primary tools are mypy for type checking and ruff for linting and formatting.

Type Checking with mypy

Type Checking Configuration Structure

mypy Configuration Hierarchy

Global Configuration

.mypy.ini1-2

The global [mypy] section excludes the tests/ directory from type checking:

This exclusion is necessary because:

Test code uses mocking patterns (MockConfig, monkeypatch.setattr) that violate type safety
Test fixtures dynamically create objects that mypy cannot validate
Test code prioritizes flexibility over type safety

Core Module Configuration

Module Pattern	Configuration Options	Rationale
`[mypy-dbgpt.rag.*]`	`strict_optional=False` `ignore_missing_imports=True` `follow_imports=skip`	RAG components use optional types flexibly; many embedding/vector store libraries lack type stubs
`[mypy-dbgpt.app.*]`	`follow_imports=skip`	Application layer integrates external services without complete type information
`[mypy-dbgpt.serve.*]`	`follow_imports=skip`	Service layer uses FastAPI and other frameworks with dynamic typing patterns
`[mypy-dbgpt.model.*]`	`follow_imports=skip`	Model layer interfaces with ML frameworks (PyTorch, vLLM) that have incomplete type stubs
`[mypy-dbgpt.util.*]`	`follow_imports=skip`	Utility modules have diverse dependencies with varying type coverage

Third-party Library Configurations

DB-GPT configures mypy to handle third-party libraries lacking type stubs or having incompatible type definitions. Configuration pattern: [mypy-<library>.*] with ignore_missing_imports=True.

Storage and Vector Store Libraries

Library	Configuration Section	Purpose
`weaviate`	`[mypy-weaviate.*]`	Weaviate vector database client
`pymilvus`	`[mypy-pymilvus.*]`	Milvus vector database client
`elasticsearch`	`[mypy-elasticsearch.*]`	Elasticsearch client
`msgpack`	`[mypy-msgpack.*]`	Binary serialization library
`rocksdict`	`[mypy-rocksdict.*]`	RocksDB Python bindings
`cryptography`	`[mypy-cryptography.*]`	Cryptographic operations

Data Source Libraries

Library	Configuration Section	Purpose
`pyspark`	`[mypy-pyspark.*]`	Apache Spark Python API
`sqlparse`	`[mypy-sqlparse.*]`	SQL parsing library
`clickhouse_connect`	`[mypy-clickhouse_connect.*]`	ClickHouse database client
`neo4j`	`[mypy-neo4j.*]`	Neo4j graph database client

Agent and NLP Libraries

Library	Configuration Section	Purpose
`unstructured`	`[mypy-unstructured.*]`	Document processing library
`ollama`	`[mypy-ollama.*]`	Ollama API client
`pypdf`	`[mypy-pypdf.*]`	PDF processing library
`networkx`	`[mypy-networkx.*]`	Graph algorithms library
`seaborn`	`[mypy-seaborn.*]`	Statistical visualization
`rich`	`[mypy-rich.*]`	Terminal formatting
`qianfan`	`[mypy-qianfan.*]`	Baidu Qianfan API client

Pydantic Configuration

[mypy-pydantic.*] uses strict_optional=False and follow_imports=skip to accommodate Pydantic's dynamic model generation and metaclass patterns.

Running mypy

mypy generates a .mypy_cache/ directory (excluded from git via .gitignore136) for incremental type checking performance.

Common mypy Errors and Resolutions

Error Type	Description	Resolution
`error: Missing type annotation`	Variable lacks type hint	Add type annotation: `var: str = "value"`
`error: Incompatible types`	Type mismatch in assignment	Check type compatibility or use cast
`error: Cannot find implementation or library stub`	Missing type stubs for library	Add to `.mypy.ini` with `ignore_missing_imports=True`
`error: "Optional[X]" has no attribute "Y"`	Accessing attribute on optional type	Check for None: `if obj is not None: obj.method()`

Linting and Formatting with ruff

ruff is a fast Python linter and code formatter that combines the functionality of multiple tools (flake8, isort, black) in a single binary.

ruff Features

Running ruff

Common ruff Rules

Rule Code	Description	Category
`E501`	Line too long (>88 characters)	Style
`F401`	Module imported but unused	Pyflakes
`F841`	Local variable assigned but never used	Pyflakes
`I001`	Import block is un-sorted or un-formatted	isort
`W291`	Trailing whitespace	Whitespace
`C901`	Function is too complex	Complexity
`S101`	Use of assert detected	Security (bandit)

Integration with Development Workflow

Code Quality Best Practices

Type Annotation Guidelines

Add type hints to function signatures:
Use Optional for nullable values:
Use generic types for collections:
Use Protocol for structural typing:

Linting Best Practices

Fix auto-fixable issues: Run ruff check --fix before committing
Keep lines under 88 characters: Follow Black's line length convention
Remove unused imports: Clean up unused imports regularly
Organize imports: Use isort or ruff's import sorting
Address complexity warnings: Refactor complex functions (C901 warnings)

Configuration Management

Project-wide settings: Configure ruff and mypy in pyproject.toml or dedicated config files
Module-specific overrides: Use .mypy.ini sections for module-specific settings
IDE integration: Configure VS Code, PyCharm, or other IDEs to run quality tools on save
Pre-commit hooks: Set up pre-commit hooks to run quality checks before commits

Sources: .mypy.ini1-127 examples/rag/embedding_rag_example.py1-59 examples/rag/rag_embedding_api_example.py1-89

Summary

The DB-GPT testing framework provides comprehensive testing infrastructure:

Test Organization: Clear separation of unit and integration tests by component
Testing Framework: pytest with fixtures, parametrization, and monkeypatching
Test Types: Unit tests for isolated components, integration tests for workflows
Test Utilities: Reusable patterns for mocking, configuration, and validation
Best Practices: Test isolation, descriptive naming, fixture reusability

Key Test Categories:

Embedding Engine: URL and document embedding workflows
Vector Store: Import validation and storage operations
Plugin System: Security validation, scanning, and lifecycle management
KBQA: Knowledge-based question answering pipelines

Running Tests:

For contributing tests to the codebase, see Contributing Guidelines.

Testing and Quality Assurance

Purpose and Scope

Test Directory Structure

Directory Organization

Test Types and Organization

Unit Tests

Integration Tests

Testing Tools and Frameworks

Primary Testing Framework: pytest

Test Categories by Component

Embedding Engine Tests

Plugin Testing Infrastructure

Plugin Test Structure

Plugin Test Constants

Mock Configuration Objects

Test Scenarios

Writing Tests

Test File Naming Conventions

Test Function Patterns

Test Organization Guidelines

Running Tests

Test Execution Commands

Test Execution Flow

Test Utilities and Common Patterns

Import Validation Pattern

Mock Configuration Pattern

User Input Simulation Pattern

Embedding Engine Test Pattern

Testing Best Practices

Test Isolation

Mock External Dependencies

Descriptive Test Names

Test Documentation

Fixture Reusability

Test Coverage

Test Performance

Test Configuration and Setup

Test Data Location

Required Test Configuration

Integration Testing

Integration Test Organization

Integration Test Characteristics

Test Artifacts and Ignored Files

Test Artifacts in .gitignore

GitHub Actions CI Pipeline

CI Workflow Configuration

CI Pipeline Steps

Viewing CI Results

Coverage Reporting

Coverage Metrics

Running Coverage Locally

Coverage Best Practices

Pre-commit Hooks and Local Quality Checks

Pre-commit Hook Installation

Makefile Commands

Development Dependencies

Testing Dependencies

Installing Development Dependencies

Dev Container Testing Environment

Dev Container Configuration

Code Quality Tools

Type Checking with mypy

Linting and Formatting with ruff

Code Quality Best Practices

Summary

On this page

Testing and Quality Assurance

Purpose and Scope

Test Directory Structure

Directory Organization

Test Types and Organization

Unit Tests

Integration Tests

Testing Tools and Frameworks

Primary Testing Framework: pytest

Test Categories by Component

Embedding Engine Tests

Plugin Testing Infrastructure

Plugin Test Structure

Plugin Test Constants