Skip to content

Latest commit

 

History

History
191 lines (155 loc) · 36.2 KB

File metadata and controls

191 lines (155 loc) · 36.2 KB

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications

如果您觉得本项目有帮助,请为我们点亮Star🌟

license PyPI

简体中文 | English


OpenOCR 是由复旦大学FVL实验室姜育刚教授陈智能教授指导的OCR团队打造的开源平台,面向「文字检测与识别」、「公式与表格识别」、「文档解析和理解」等通用 OCR 任务。平台集成了统一的训练与评测基准、商用级 OCR 与文档解析系统,以及众多学术论文的核心代码复现。

OpenOCR 致力于构建一个为学术研究与实际应用搭建桥梁的通用 OCR 开源生态,推动 OCR 技术在研究前沿和产业场景中的协同发展与广泛落地。欢迎研究者、开发者和企业使用和提建议。

核心特性

自研OCR算法

  • UniRec-0.1B (Yongkun Du, Zhineng Chen, Yazhen Xie, Weikang Bai, Hao Feng, Wei Shi, Yuchen Su, Can Huang, Yu-Gang Jiang. UniRec-0.1B: Unified Text and Formula Recognition with 0.1B Parameters, Preprint. Doc, Paper)
  • MDiff4STR (Yongkun Du, Miaomiao Zhao, Songlin Fan, Zhineng Chen*, Caiyan Jia, Yu-Gang Jiang. MDiff4STR: Mask Diffusion Model for Scene Text Recognition, AAAI 2026 Oral. Doc, Paper)
  • CMER (Weikang Bai, Yongkun Du, Yuchen Su, Yazhen Xie, Zhineng Chen*. Complex Mathematical Expression Recognition: Benchmark, Large-Scale Dataset and Strong Baseline, AAAI 2026. Doc, Paper.)
  • TextSSR (Xingsong Ye, Yongkun Du, Yunbo Tao, Zhineng Chen*. TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition, ICCV 2025. Paper, Code)
  • SVTRv2 (Yongkun Du, Zhineng Chen*, Hongtao Xie, Caiyan Jia, Yu-Gang Jiang. SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition, ICCV 2025. Doc, Paper)
  • IGTR (Yongkun Du, Zhineng Chen*, Yuchen Su, Caiyan Jia, Yu-Gang Jiang. Instruction-Guided Scene Text Recognition, TPAMI 2025. Doc, Paper)
  • CPPD (Yongkun Du, Zhineng Chen*, Caiyan Jia, Xiaoting Yin, Chenxia Li, Yuning Du, Yu-Gang Jiang. Context Perception Parallel Decoder for Scene Text Recognition, TPAMI 2025. PaddleOCR Doc, Paper)
  • SMTR&FocalSVTR (Yongkun Du, Zhineng Chen*, Caiyan Jia, Xieping Gao, Yu-Gang Jiang. Out of Length Text Recognition with Sub-String Matching, AAAI 2025. Doc, Paper)
  • DPTR (Shuai Zhao, Yongkun Du, Zhineng Chen*, Yu-Gang Jiang. Decoder Pre-Training with only Text for Scene Text Recognition, ACM MM 2024. Paper)
  • CDistNet (Tianlun Zheng, Zhineng Chen*, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang. CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition, IJCV 2024. Paper)
  • MRN (Tianlun Zheng, Zhineng Chen*, Bingchen Huang, Wei Zhang, Yu-Gang Jiang. MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition, ICCV 2023. Paper, Code)
  • TPS++ (Tianlun Zheng, Zhineng Chen*, Jinfeng Bai, Hongtao Xie, Yu-Gang Jiang. TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition, IJCAI 2023. Paper, Code)
  • SVTR (Yongkun Du, Zhineng Chen*, Caiyan Jia, Xiaoting Yin, Tianlun Zheng, Chenxia Li, Yuning Du, Yu-Gang Jiang. SVTR: Scene Text Recognition with a Single Visual Model, IJCAI 2022 (Long). PaddleOCR Doc, Paper)
  • NRTR (Fenfen Sheng, Zhineng Chen, Bo Xu. NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition, ICDAR 2019. Paper)

近期更新

算法复现计划

场景文本识别(STR)

方法 会议/期刊 训练支持 评估支持 贡献者
CRNN TPAMI 2016
ASTER TPAMI 2019 pretto0
NRTR ICDAR 2019
SAR AAAI 2019 pretto0
MORAN PR 2019
DAN AAAI 2020
RobustScanner ECCV 2020 pretto0
AutoSTR ECCV 2020
SRN CVPR 2020 pretto0
SEED CVPR 2020
ABINet CVPR 2021 YesianRohn
VisionLAN ICCV 2021 YesianRohn
PIMNet ACM MM 2021 TODO
SVTR IJCAI 2022
PARSeq ECCV 2022
MATRN ECCV 2022
MGP-STR ECCV 2022
LPV IJCAI 2023
MAERec(Union14M) ICCV 2023
LISTER ICCV 2023
CDistNet IJCV 2024 YesianRohn
BUSNet AAAI 2024
DCTC AAAI 2024 TODO
CAM PR 2024
OTE CVPR 2024
CFF IJCAI 2024 TODO
DPTR ACM MM 2024 fd-zs
VIPTR ACM CIKM 2024 TODO
IGTR TPAMI 2025
SMTR AAAI 2025
CPPD TPAMI 2025
FocalSVTR-CTC AAAI 2025
SVTRv2 ICCV 2025
ResNet+Trans-CTC
ViT-CTC
MDiff4STR AAAI 2025 Oral

场景文本检测(STD)

开发中

端到端文本识别(Text Spotting)

开发中


引用

如果我们的工作对您的研究有所帮助,请引用:

@inproceedings{Du2025SVTRv2,
  title={SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition},
  author={Yongkun Du and Zhineng Chen and Hongtao Xie and Caiyan Jia and Yu-Gang Jiang},
  booktitle={ICCV},
  year={2025},
  pages={20147-20156}
}

@article{du2025unirec,
  title={UniRec-0.1B: Unified Text and Formula Recognition with 0.1B Parameters},
  author={Yongkun Du and Zhineng Chen and Yazhen Xie and Weikang Bai and Hao Feng and Wei Shi and Yuchen Su and Can Huang and Yu-Gang Jiang},
  journal={arXiv preprint arXiv:2512.21095},
  year={2025}
}

致谢

本代码库基于PaddleOCRPytorchOCRMMOCR构建,感谢他们的出色工作!