report-detect/python_api/models/PP-DocLayoutV3
黄仁欢 c7aa33c4a0 Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00
..
.cache/huggingface Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00
.gitattributes Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00
README.md Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00
inference.json Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00
inference.pdiparams Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00
inference.yml Use local OCR models and include offline model files 2026-03-16 16:34:15 +08:00

README.md

license pipeline_tag tags language library_name
apache-2.0 image-segmentation
PaddleOCR
PaddlePaddle
image-segmentation
ocr
layout
layout_detection
en
zh
multilingual
PaddleOCR

Layout Analysis Module of PaddleOCR-VL-1.5

repo HuggingFace ModelScope HuggingFace ModelScope Discord X License

🔥 Official Website | 📝 Technical Report

Introduction

This is the PP-Doclayoutv3 model weights for the PaddlePaddle framework. Get safetensors weights at PP-DocLayoutV3_safetensors

PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors. This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL.

Model Architecture

Visualization

Light Variation

Skewing

Screen-photo

Curving

Citation

If you find PP-DocLayoutV3 helpful, feel free to give us a star and citation.

@misc{cui2026paddleocrvl15multitask09bvlm,
      title={PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing}, 
      author={Cheng Cui and Ting Sun and Suyin Liang and Tingquan Gao and Zelun Zhang and Jiaxuan Liu and Xueqing Wang and Changda Zhou and Hongen Liu and Manhui Lin and Yue Zhang and Yubo Zhang and Yi Liu and Dianhai Yu and Yanjun Ma},
      year={2026},
      eprint={2601.21957},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2601.21957}, 
}