- ERNIE Team, Baidu. (2025). ERNIE 4.5 Technical Report.
- Cui, C. et al. (2025). PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model.
- Cui, C. et al. (2026). PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing.
- Wang, H. et al. (2026). ERNIE 5.0 Technical Report.