Version Migration Guide

Relevant source files

Purpose and Scope

This document provides comprehensive guidance for migrating code from PaddleOCR 2.x to PaddleOCR 3.x. It covers the major architectural changes, API modifications, pipeline renames, and dependency updates that affect compatibility. Users currently running PaddleOCR 2.x should consult this guide before upgrading to understand breaking changes and required code modifications.

For information about the new features and capabilities in PaddleOCR 3.x, see System Architecture and Major Pipelines. For installation instructions, see Installation and Environment Setup.

1. Why PaddleOCR 3.x?

PaddleOCR 3.x represents a fundamental architectural redesign driven by four years of rapid feature expansion since the 2.0 release in February 2021. The 2.x series began with a lightweight-centric architecture but accumulated significant technical debt as new capabilities (multilingual recognition, layout analysis, document understanding) were added incrementally.

Key motivations for the 3.x upgrade:

Challenge in 2.x	Solution in 3.x
Code duplication and inconsistent interfaces across modules	Modular, plugin-based architecture with unified interfaces
Legacy dependencies incompatible with PaddlePaddle 3.0+	Full compatibility with PaddlePaddle 3.0's CINN compiler and new features
Limited integration with vision-language models	Native integration with VLMs (PaddleOCR-VL, PP-ChatOCRv4)
Fragmented deployment capabilities	Unified deployment framework through PaddleX integration
High maintenance burden from "bridging" layers	Clean separation between user API and underlying capabilities

Sources: README.md83-84 docs/update/upgrade_notes.md1-14 docs/update/upgrade_notes.en.md1-14

2. Architecture Evolution: 2.x to 3.x

The following diagram illustrates the fundamental architectural shift from PaddleOCR 2.x to 3.x:

PaddleOCR 2.x vs 3.x Architecture Comparison

Key architectural changes:

Unified Base Class: All pipelines in 3.x inherit from PaddleXPipelineWrapper, providing consistent behavior across different tasks
PaddleX Integration: Inference deployment now built on PaddleX's pipeline infrastructure, eliminating code duplication
Plugin System: High-performance inference (TensorRT, ONNX), service deployment, and hardware acceleration implemented as plugins
Clean Separation: User-facing API remains simple while complex multi-model orchestration handled by PaddleX layer

Sources: README.md77-81 docs/version3.x/paddleocr_and_paddlex.md13-22 docs/update/upgrade_notes.md15-22

3. Version Compatibility Matrix

PaddleOCR 3.x introduces strict version dependencies between three components: PaddleOCR, PaddleX, and PaddlePaddle framework. The following table shows the compatibility requirements:

PaddleOCR Version	PaddleX Version	PaddlePaddle Version	Notes
`3.0.0`	`3.0.0`	`>= 3.0.0`	Initial 3.x release
`3.0.1`	`3.0.1`	`>= 3.0.0`	Bug fixes
`3.0.2`	`3.0.2`	`>= 3.0.0`	Performance improvements
`3.0.3`	`>= 3.0.3`	`>= 3.0.0`	Relaxed PaddleX constraint
`3.1.x`	`>= 3.1.0, < 3.2.0`	`>= 3.0.0`	MCP server, multilingual
`3.2.x`	`>= 3.2.0, < 3.3.0`	`>= 3.0.0`	C++ deployment, benchmarks
`3.3.x`	`>= 3.3.0, < 3.4.0`	`>= 3.0.0`	PaddleOCR-VL
`3.4.x`	`>= 3.4.0, < 3.5.0`	`>= 3.0.0`	PaddleOCR-VL-1.5

Dependency Installation:

The PaddleX dependency is automatically installed when installing PaddleOCR. Note that only OCR-related dependencies are installed by default (~738 MB total), not all PaddleX capabilities.

Sources: docs/version3.x/paddleocr_and_paddlex.md25-36 docs/quick_start.en.md9-26 pyproject.toml41-46

4. API Changes and Migration

4.1 PaddleOCR Class Changes

PaddleOCR 2.x to 3.x API Comparison

Key changes in the PaddleOCR class:

Initialization parameters preserved: lang, use_angle_cls, use_gpu still work as expected
New initialization parameters:
- use_doc_orientation_classify: Document orientation classification (replaces use_angle_cls for document-level rotation)
- use_doc_unwarping: Text image unwarping preprocessing
- use_textline_orientation: Text line orientation classification
- paddlex_config: Path to PaddleX configuration file or config dictionary
Removed ocr() method parameters: det, rec, cls - use separate classes instead
New unified predict() method: Returns result objects instead of nested lists

Migration example:

Sources: docs/update/upgrade_notes.md26-68 docs/quick_start.en.md63-111 docs/update/upgrade_notes.en.md25-67

4.2 Result Object Changes

PaddleOCR 3.x Result Structure

3.x result object structure:

Sources: docs/quick_start.en.md89-111 docs/quick_start.md87-109

4.3 Removed and Deprecated Features

Features removed in 3.x:

2.x Feature	3.x Replacement	Migration Path
`ocr.ocr(det=False)`	`TextDetection` class	Use `from paddleocr import TextDetection`
`ocr.ocr(rec=False)`	`TextRecognition` class	Use `from paddleocr import TextRecognition`
`PPStructure` class	`PPStructureV3` class	Update class name and consult new API docs
`use_onnx` parameter	High-performance inference	Use `enable_hpi=True` or configure via PaddleX config
`show_log` parameter	New logging system	Import and configure `paddleocr._utils.logging.logger`
`draw_ocr()` function	`Result.save_to_img()`	Call method on result object

Example migrations:

Sources: docs/update/upgrade_notes.md69-73 docs/update/upgrade_notes.en.md68-72

4.4 Logging System Changes

The show_log parameter in 2.x had a design flaw: all PaddleOCR instances shared a single global logger, causing cross-instance interference. PaddleOCR 3.x introduces a new logging system:

Sources: docs/update/upgrade_notes.md69

5. Pipeline Name Mapping

PaddleOCR 3.x introduces new pipeline names aligned with PaddleX conventions. The following diagram shows the evolution:

Pipeline Evolution from 2.x to 3.x

Detailed mapping table:

2.x Pipeline	3.x Pipeline	PaddleX Registration Name	Primary Use Case
`PaddleOCR`	`PaddleOCR` (PP-OCRv5)	`OCR`	General text detection + recognition
N/A	`TextDetection`	Part of `OCR`	Text detection only
N/A	`TextRecognition`	Part of `OCR`	Text recognition only
`PPStructure`	`PPStructureV3`	`PP-StructureV3`	Document layout + multi-element parsing
N/A	`PPChatOCRv4Doc`	`PP-ChatOCRv4-doc`	Intelligent info extraction with LLM
N/A	`PaddleOCRVL`	`PaddleOCR-VL`, `PaddleOCR-VL-1.5`	VLM-based document parsing
N/A	`TableRecognitionPipelineV2`	`table_recognition_v2`	Table structure recognition
N/A	`SealRecognition`	`seal_recognition`	Seal/stamp text recognition
N/A	`FormulaRecognitionPipeline`	`formula_recognition`	Mathematical formula recognition
N/A	`PPDocTranslation`	`PP-DocTranslation`	Document translation

Model version selection:

The PaddleOCR class in 3.x defaults to PP-OCRv5 models but supports backward compatibility:

Sources: docs/version3.x/paddleocr_and_paddlex.md38-51 paddleocr/_pipelines/pp_structurev3.py25

6. Configuration System Changes

PaddleOCR 3.x introduces a hierarchical configuration system through PaddleX integration:

Configuration Flow Diagram

6.1 Exporting Configuration Files

Or via CLI:

6.2 Using Configuration Files

Python API:

CLI:

6.3 Configuration Override System

PaddleOCR parameters are mapped to PaddleX's nested structure via the STRUCTURE dictionary pattern found in each pipeline implementation:

This allows PaddleOCR's simplified parameter names to map to PaddleX's deeply nested configuration structure.

Sources: docs/version3.x/paddleocr_and_paddlex.md53-97 paddleocr/_pipelines/pp_structurev3.py304-337

7. Dependency Changes

7.1 Core Dependencies

pyproject.toml dependency specification:

7.2 Installation Options

The modular dependency system ensures that installing PaddleOCR doesn't pull in all of PaddleX's dependencies—only those needed for the selected features.

Sources: pyproject.toml41-61 docs/version3.x/paddleocr_and_paddlex.md23

8. Step-by-Step Migration Guide

8.1 Assessment Phase

Checklist for migration planning:

□ Identify all PaddleOCR 2.x usage locations in codebase
□ Determine which pipelines are used (OCR, PPStructure, etc.)
□ Check if custom models are used (model paths, custom configs)
□ Review deployment method (local inference, service, mobile)
□ Verify PaddlePaddle version compatibility (must be 3.0+)

8.2 Basic OCR Migration

Step 1: Update installation

Step 2: Update imports and initialization

Step 3: Update inference calls

Step 4: Update visualization

8.3 PPStructure Migration

8.4 Module-Specific Inference

8.5 High-Performance Inference

Sources: docs/update/upgrade_notes.md26-73 docs/quick_start.en.md63-197

9. Known Issues and Limitations

PaddleOCR 3.x continues to evolve. The following limitations are known as of version 3.4.x:

9.1 Deployment Limitations

Feature	Status	Workaround
C++ local deployment	Partial support (Linux/Windows only for select models)	Use Python deployment or wait for updates
High-throughput service deployment	Basic support available, not performance-parity with PaddleServing	Use Docker-based deployment, monitor for updates
Mobile/edge deployment	Limited to key models (PP-OCRv5, select modules)	Check model compatibility before deployment

9.2 Backward Compatibility Notes

2.x model files: Most 2.x trained models require re-export for 3.x compatibility
Custom training configs: YAML format changes require updating training configuration files
Legacy features: Some 2.x features moved to legacy support, see Historical Features

9.3 Migration Support

For issues during migration:

Documentation: Consult the complete documentation at PaddleOCR Docs
GitHub Issues: Report migration problems at PaddleOCR Issues
Community: Join WeChat groups or forums for community support

Sources: docs/update/upgrade_notes.md75-83 docs/update/upgrade_notes.en.md74-82 README.md148-222

10. Version-Specific Migration Notes

10.1 Migrating to 3.0.x (Initial Release)

Major changes:

First release with PaddleX integration
PP-OCRv5, PP-StructureV3, PP-ChatOCRv4 introduced
Required PaddlePaddle 3.0.0+

Breaking changes:

All API changes described in Section 4
PPStructure completely replaced by PPStructureV3

10.2 Migrating to 3.1.x

New features:

MCP server for agent applications
PP-DocTranslation pipeline
Multilingual PP-OCRv5 models (37+ languages)

Additional changes:

Relaxed dependency versions (numpy, pandas)
Python 3.12 support restored

10.3 Migrating to 3.2.x

New features:

Enhanced C++ deployment (Linux + Windows)
CUDA 12 support
Benchmark system for per-layer timing

Configuration changes:

enable_mkldnn parameter behavior clarified
New MKL-DNN cache capacity defaults

10.4 Migrating to 3.3.x / 3.4.x

New features:

PaddleOCR-VL (0.9B VLM) - 3.3.x
PaddleOCR-VL-1.5 - 3.4.x
111 language support in VLM
Cross-page table merging in PP-StructureV3

Model changes:

Default OCR models remain PP-OCRv5
VLM available via PaddleOCRVL class

Sources: README.md85-237

Summary: PaddleOCR 3.x represents a major architectural evolution requiring careful migration planning. The core workflow remains conceptually similar, but implementation details have changed significantly to support modular design, VLM integration, and PaddlePaddle 3.0 features. Most migration effort involves updating API calls, result handling, and logging configuration. The new architecture provides cleaner abstractions and better extensibility for future enhancements.

Version Migration Guide

Relevant source files

Purpose and Scope

For information about the new features and capabilities in PaddleOCR 3.x, see System Architecture and Major Pipelines. For installation instructions, see Installation and Environment Setup.

1. Why PaddleOCR 3.x?

Key motivations for the 3.x upgrade:

Challenge in 2.x	Solution in 3.x
Code duplication and inconsistent interfaces across modules	Modular, plugin-based architecture with unified interfaces
Legacy dependencies incompatible with PaddlePaddle 3.0+	Full compatibility with PaddlePaddle 3.0's CINN compiler and new features
Limited integration with vision-language models	Native integration with VLMs (PaddleOCR-VL, PP-ChatOCRv4)
Fragmented deployment capabilities	Unified deployment framework through PaddleX integration
High maintenance burden from "bridging" layers	Clean separation between user API and underlying capabilities

Sources: README.md83-84 docs/update/upgrade_notes.md1-14 docs/update/upgrade_notes.en.md1-14

2. Architecture Evolution: 2.x to 3.x

The following diagram illustrates the fundamental architectural shift from PaddleOCR 2.x to 3.x:

PaddleOCR 2.x vs 3.x Architecture Comparison

Key architectural changes:

Unified Base Class: All pipelines in 3.x inherit from PaddleXPipelineWrapper, providing consistent behavior across different tasks
PaddleX Integration: Inference deployment now built on PaddleX's pipeline infrastructure, eliminating code duplication
Plugin System: High-performance inference (TensorRT, ONNX), service deployment, and hardware acceleration implemented as plugins
Clean Separation: User-facing API remains simple while complex multi-model orchestration handled by PaddleX layer

Sources: README.md77-81 docs/version3.x/paddleocr_and_paddlex.md13-22 docs/update/upgrade_notes.md15-22

3. Version Compatibility Matrix

PaddleOCR 3.x introduces strict version dependencies between three components: PaddleOCR, PaddleX, and PaddlePaddle framework. The following table shows the compatibility requirements:

PaddleOCR Version	PaddleX Version	PaddlePaddle Version	Notes
`3.0.0`	`3.0.0`	`>= 3.0.0`	Initial 3.x release
`3.0.1`	`3.0.1`	`>= 3.0.0`	Bug fixes
`3.0.2`	`3.0.2`	`>= 3.0.0`	Performance improvements
`3.0.3`	`>= 3.0.3`	`>= 3.0.0`	Relaxed PaddleX constraint
`3.1.x`	`>= 3.1.0, < 3.2.0`	`>= 3.0.0`	MCP server, multilingual
`3.2.x`	`>= 3.2.0, < 3.3.0`	`>= 3.0.0`	C++ deployment, benchmarks
`3.3.x`	`>= 3.3.0, < 3.4.0`	`>= 3.0.0`	PaddleOCR-VL
`3.4.x`	`>= 3.4.0, < 3.5.0`	`>= 3.0.0`	PaddleOCR-VL-1.5

Dependency Installation:

The PaddleX dependency is automatically installed when installing PaddleOCR. Note that only OCR-related dependencies are installed by default (~738 MB total), not all PaddleX capabilities.

Sources: docs/version3.x/paddleocr_and_paddlex.md25-36 docs/quick_start.en.md9-26 pyproject.toml41-46

4. API Changes and Migration

4.1 PaddleOCR Class Changes

PaddleOCR 2.x to 3.x API Comparison

Key changes in the PaddleOCR class:

Initialization parameters preserved: lang, use_angle_cls, use_gpu still work as expected
New initialization parameters:
- use_doc_orientation_classify: Document orientation classification (replaces use_angle_cls for document-level rotation)
- use_doc_unwarping: Text image unwarping preprocessing
- use_textline_orientation: Text line orientation classification
- paddlex_config: Path to PaddleX configuration file or config dictionary
Removed ocr() method parameters: det, rec, cls - use separate classes instead
New unified predict() method: Returns result objects instead of nested lists

Migration example:

Sources: docs/update/upgrade_notes.md26-68 docs/quick_start.en.md63-111 docs/update/upgrade_notes.en.md25-67

4.2 Result Object Changes

PaddleOCR 3.x Result Structure

3.x result object structure:

Sources: docs/quick_start.en.md89-111 docs/quick_start.md87-109

4.3 Removed and Deprecated Features

Features removed in 3.x:

2.x Feature	3.x Replacement	Migration Path
`ocr.ocr(det=False)`	`TextDetection` class	Use `from paddleocr import TextDetection`
`ocr.ocr(rec=False)`	`TextRecognition` class	Use `from paddleocr import TextRecognition`
`PPStructure` class	`PPStructureV3` class	Update class name and consult new API docs
`use_onnx` parameter	High-performance inference	Use `enable_hpi=True` or configure via PaddleX config
`show_log` parameter	New logging system	Import and configure `paddleocr._utils.logging.logger`
`draw_ocr()` function	`Result.save_to_img()`	Call method on result object

Example migrations:

Sources: docs/update/upgrade_notes.md69-73 docs/update/upgrade_notes.en.md68-72

4.4 Logging System Changes

The show_log parameter in 2.x had a design flaw: all PaddleOCR instances shared a single global logger, causing cross-instance interference. PaddleOCR 3.x introduces a new logging system:

Sources: docs/update/upgrade_notes.md69

5. Pipeline Name Mapping

PaddleOCR 3.x introduces new pipeline names aligned with PaddleX conventions. The following diagram shows the evolution:

Pipeline Evolution from 2.x to 3.x

Detailed mapping table:

2.x Pipeline	3.x Pipeline	PaddleX Registration Name	Primary Use Case
`PaddleOCR`	`PaddleOCR` (PP-OCRv5)	`OCR`	General text detection + recognition
N/A	`TextDetection`	Part of `OCR`	Text detection only
N/A	`TextRecognition`	Part of `OCR`	Text recognition only
`PPStructure`	`PPStructureV3`	`PP-StructureV3`	Document layout + multi-element parsing
N/A	`PPChatOCRv4Doc`	`PP-ChatOCRv4-doc`	Intelligent info extraction with LLM
N/A	`PaddleOCRVL`	`PaddleOCR-VL`, `PaddleOCR-VL-1.5`	VLM-based document parsing
N/A	`TableRecognitionPipelineV2`	`table_recognition_v2`	Table structure recognition
N/A	`SealRecognition`	`seal_recognition`	Seal/stamp text recognition
N/A	`FormulaRecognitionPipeline`	`formula_recognition`	Mathematical formula recognition
N/A	`PPDocTranslation`	`PP-DocTranslation`	Document translation

Model version selection:

The PaddleOCR class in 3.x defaults to PP-OCRv5 models but supports backward compatibility:

Sources: docs/version3.x/paddleocr_and_paddlex.md38-51 paddleocr/_pipelines/pp_structurev3.py25

6. Configuration System Changes

PaddleOCR 3.x introduces a hierarchical configuration system through PaddleX integration:

Configuration Flow Diagram

6.1 Exporting Configuration Files

Or via CLI:

6.2 Using Configuration Files

Python API:

CLI:

6.3 Configuration Override System

PaddleOCR parameters are mapped to PaddleX's nested structure via the STRUCTURE dictionary pattern found in each pipeline implementation:

This allows PaddleOCR's simplified parameter names to map to PaddleX's deeply nested configuration structure.

Sources: docs/version3.x/paddleocr_and_paddlex.md53-97 paddleocr/_pipelines/pp_structurev3.py304-337

7. Dependency Changes

7.1 Core Dependencies

pyproject.toml dependency specification:

7.2 Installation Options

The modular dependency system ensures that installing PaddleOCR doesn't pull in all of PaddleX's dependencies—only those needed for the selected features.

Sources: pyproject.toml41-61 docs/version3.x/paddleocr_and_paddlex.md23

8. Step-by-Step Migration Guide

8.1 Assessment Phase

Checklist for migration planning:

□ Identify all PaddleOCR 2.x usage locations in codebase
□ Determine which pipelines are used (OCR, PPStructure, etc.)
□ Check if custom models are used (model paths, custom configs)
□ Review deployment method (local inference, service, mobile)
□ Verify PaddlePaddle version compatibility (must be 3.0+)

8.2 Basic OCR Migration

Step 1: Update installation

Step 2: Update imports and initialization

Step 3: Update inference calls

Step 4: Update visualization

8.3 PPStructure Migration

8.4 Module-Specific Inference

8.5 High-Performance Inference

Sources: docs/update/upgrade_notes.md26-73 docs/quick_start.en.md63-197

9. Known Issues and Limitations

PaddleOCR 3.x continues to evolve. The following limitations are known as of version 3.4.x:

9.1 Deployment Limitations

Feature	Status	Workaround
C++ local deployment	Partial support (Linux/Windows only for select models)	Use Python deployment or wait for updates
High-throughput service deployment	Basic support available, not performance-parity with PaddleServing	Use Docker-based deployment, monitor for updates
Mobile/edge deployment	Limited to key models (PP-OCRv5, select modules)	Check model compatibility before deployment

9.2 Backward Compatibility Notes

2.x model files: Most 2.x trained models require re-export for 3.x compatibility
Custom training configs: YAML format changes require updating training configuration files
Legacy features: Some 2.x features moved to legacy support, see Historical Features

9.3 Migration Support

For issues during migration:

Documentation: Consult the complete documentation at PaddleOCR Docs
GitHub Issues: Report migration problems at PaddleOCR Issues
Community: Join WeChat groups or forums for community support

Sources: docs/update/upgrade_notes.md75-83 docs/update/upgrade_notes.en.md74-82 README.md148-222

10. Version-Specific Migration Notes

10.1 Migrating to 3.0.x (Initial Release)

Major changes:

First release with PaddleX integration
PP-OCRv5, PP-StructureV3, PP-ChatOCRv4 introduced
Required PaddlePaddle 3.0.0+

Breaking changes:

All API changes described in Section 4
PPStructure completely replaced by PPStructureV3

10.2 Migrating to 3.1.x

New features:

MCP server for agent applications
PP-DocTranslation pipeline
Multilingual PP-OCRv5 models (37+ languages)

Additional changes:

Relaxed dependency versions (numpy, pandas)
Python 3.12 support restored

10.3 Migrating to 3.2.x

New features:

Enhanced C++ deployment (Linux + Windows)
CUDA 12 support
Benchmark system for per-layer timing

Configuration changes:

enable_mkldnn parameter behavior clarified
New MKL-DNN cache capacity defaults

10.4 Migrating to 3.3.x / 3.4.x

New features:

PaddleOCR-VL (0.9B VLM) - 3.3.x
PaddleOCR-VL-1.5 - 3.4.x
111 language support in VLM
Cross-page table merging in PP-StructureV3

Model changes:

Default OCR models remain PP-OCRv5
VLM available via PaddleOCRVL class

Sources: README.md85-237

Version Migration Guide

Purpose and Scope

1. Why PaddleOCR 3.x?

2. Architecture Evolution: 2.x to 3.x

3. Version Compatibility Matrix

4. API Changes and Migration

4.1 PaddleOCR Class Changes

4.2 Result Object Changes

4.3 Removed and Deprecated Features

4.4 Logging System Changes

5. Pipeline Name Mapping

6. Configuration System Changes

6.1 Exporting Configuration Files

6.2 Using Configuration Files

6.3 Configuration Override System

7. Dependency Changes

7.1 Core Dependencies

7.2 Installation Options

8. Step-by-Step Migration Guide

8.1 Assessment Phase

8.2 Basic OCR Migration

8.3 PPStructure Migration

8.4 Module-Specific Inference

8.5 High-Performance Inference

9. Known Issues and Limitations

9.1 Deployment Limitations

9.2 Backward Compatibility Notes

9.3 Migration Support

10. Version-Specific Migration Notes

10.1 Migrating to 3.0.x (Initial Release)

10.2 Migrating to 3.1.x

10.3 Migrating to 3.2.x

10.4 Migrating to 3.3.x / 3.4.x

On this page

Version Migration Guide

Purpose and Scope

1. Why PaddleOCR 3.x?

2. Architecture Evolution: 2.x to 3.x

3. Version Compatibility Matrix

4. API Changes and Migration

4.1 PaddleOCR Class Changes

4.2 Result Object Changes

4.3 Removed and Deprecated Features

4.4 Logging System Changes

5. Pipeline Name Mapping

6. Configuration System Changes

6.1 Exporting Configuration Files

6.2 Using Configuration Files

6.3 Configuration Override System

7. Dependency Changes

7.1 Core Dependencies

7.2 Installation Options

8. Step-by-Step Migration Guide

8.1 Assessment Phase

8.2 Basic OCR Migration

8.3 PPStructure Migration

8.4 Module-Specific Inference

8.5 High-Performance Inference

9. Known Issues and Limitations

9.1 Deployment Limitations

9.2 Backward Compatibility Notes

9.3 Migration Support

10. Version-Specific Migration Notes

10.1 Migrating to 3.0.x (Initial Release)

10.2 Migrating to 3.1.x

10.3 Migrating to 3.2.x

10.4 Migrating to 3.3.x / 3.4.x

On this page