Overview

1. Introduction to All-in-One Development¶

The All-in-One development tool PaddleX, based on the advanced technology of PaddleOCR, supports low-code full-process development capabilities in the OCR field. Through low-code development, simple and efficient model use, combination, and customization can be achieved. This will significantly reduce the time consumption of model development, lower its development difficulty, and greatly accelerate the application and promotion speed of models in the industry. Features include:

🎨 Rich Model One-Click Call: Integrates 48 models related to text image intelligent analysis, general OCR, general layout parsing, table recognition, formula recognition, and seal recognition into 10 pipelines, which can be quickly experienced through a simple Python API one-click call. In addition, the same set of APIs also supports a total of 200+ models in image classification, object detection, image segmentation, and time series forecasting, forming 30+ single-function modules, making it convenient for developers to use model combinations.
🚀 High Efficiency and Low barrier of entry: Provides two methods based on unified commands and GUI to achieve simple and efficient use, combination, and customization of models. Supports multiple deployment methods such as high-performance inference, service-oriented deployment, and edge deployment. Additionally, for various mainstream hardware such as NVIDIA GPU, Kunlunxin XPU, Ascend NPU, Cambricon MLU, and Haiguang DCU, models can be developed with seamless switching.

Note: PaddleX is committed to achieving pipeline-level model training, inference, and deployment. A model pipeline refers to a series of predefined development processes for specific AI tasks, including combinations of single models (single-function modules) that can independently complete a type of task.

In PaddleX, all 6 OCR-related pipelines support local inference, and some pipelines support online experience. You can quickly experience the pre-trained model effects of each pipeline. If you are satisfied with the pre-trained model effects of a pipeline, you can directly proceed with high-performance inference/service-oriented deployment/edge deployment. If not satisfied, you can also use the custom development capabilities of the pipeline to improve the effects. For the complete pipeline development process, please refer to PaddleX Pipeline Usage Overview or the tutorials for each pipeline.

In addition, PaddleX provides developers with a full-process efficient model training and deployment tool based on a cloud-based GUI. Developers do not need code development, just need to prepare a dataset that meets the pipeline requirements to quickly start model training. For details, please refer to the tutorial "Developing Industrial-level AI Models with Zero Barrier".

Pipeline	Online Experience	Local Inference	High-Performance Inference	Service-Oriented Deployment	Edge Deployment	Custom Development	No-Code Development On AI Studio
Document Image Preprocessing	🚧	✅	🚧	✅	🚧	✅	🚧
OCR	Link	✅	✅	✅	✅	✅	✅
Table Recognition	Link	✅	✅	✅	🚧	✅	✅
Table Recognition V2	🚧	✅	🚧	✅	🚧	✅	🚧
Formula Recognition	Link	✅	✅	✅	🚧	✅	✅
Seal Recognition	Link	✅	✅	✅	🚧	✅	✅
Layout Parsing	🚧	✅	✅	✅	🚧	✅	🚧
Layout Parsing v2	🚧	✅	🚧	✅	🚧	🚧	🚧
PP-ChatOCRv3-doc	Link	✅	✅	✅	🚧	✅	✅
PP-ChatOCRv4-doc	🚧	✅	✅	✅	🚧	✅	🚧

❗Note: The above capabilities are implemented based on GPU/CPU. PaddleX can also perform local inference and custom development on mainstream hardware such as Kunlunxin, Ascend, Cambricon, and Haiguang. The table below details the support status of the pipelines. For specific supported model lists, please refer to the Model List (Kunlunxin XPU)/Model List (Ascend NPU)/Model List (Cambricon MLU)/Model List (Haiguang DCU). We are continuously adapting more models and promoting the implementation of high-performance and service-oriented deployment on mainstream hardware. 🚀 Support for Domestic Hardware Capabilities

Pipeline Name	Ascend 910B	Kunlunxin XPU	Cambricon MLU	Haiguang DCU
General OCR	✅	✅	✅	🚧
Table Recognition	✅	🚧	🚧	🚧

OCR Pipeline: Tutorial
Table Recognition Pipeline: Tutorial
PP-ChatOCRv3-doc Pipeline: Tutorial
Layout Parsing Pipeline: Tutorial
Formula Recognition Pipeline: Tutorial
Seal Recognition Pipeline: Tutorial

Text Detection Module: Tutorial
Seal Detection Module: Tutorial
Text Recognition Module: Tutorial
Formula Recognition Module: Tutorial
Table Structure Recognition Module: Tutorial
Text Image Unwarping Module: Tutorial
Layout Detection Module: Tutorial
Document Image Orientation Classification Module: Tutorial

Document Image Preprocessing Pipeline: Tutorial
OCR Pipeline: Tutorial
Table Recognition Pipeline: Tutorial
Table Recognition v2 Pipeline: Tutorial
Layout Parsing Pipeline: Tutorial
Layout Parsing v2 Pipeline: Tutorial
Formula Recognition: Tutorial
Seal Recognition: Tutorial
PP-ChatOCRv3-doc Pipeline: Tutorial
PP-ChatOCRv4-doc Pipeline: Tutorial

Text Detection Module: Tutorial
Seal Detection Module: Tutorial
Textline Orientation Classification Module: Tutorial
Text Recognition Module: Tutorial
Formula Recognition Module: Tutorial
Table Structure Recognition Module: Tutorial
Text Image Unwarping Module: Tutorial
Layout Detection Module: Tutorial
Document Image Orientation Classification Module: Tutorial
Table Cells Detection Module: Tutorial
Table Classification Module: Tutorial

Overview

1. Introduction to All-in-One Development¶

2. OCR-Related Capability Support¶

3. List and Tutorials of OCR-Related Model Pipelines¶

4. List and Tutorials of OCR-Related Modules¶

3. List of OCR-related Pipeline Models and Tutorials¶

4. List of OCR-related Single Function Modules and Tutorials¶