Releases: PaddlePaddle/PaddleOCR
Releases · PaddlePaddle/PaddleOCR
PaddleOCRv2.7.5
PaddleOCRv2.7.4
This release contains the missed commits from v2.7.0 to v2.7.1.
fixed : #11824
PaddleOCRv2.7.3
What's Changed
fixed #11808
PaddleOCRv2.7.2
What's Changed
- add finnish language files by @savikko in #10850
- fix
cls_x
andbbox_x
is possibly unbound by @SigureMo in #10973 - update ppocrv4_framework by @tink2123 in #11047
- Update ONNX conversion readme_ch.md by @greyovo in #11030
- [TIPC]update tipc scripts and rm fluid api by @USTCKAY in #11098
- fix a bug for rec_postprocess.py by @Ataraxy33 in #11408
- Modify readme 27 by @zhangyubo0722 in #11424
- fix: layout recovery image:xxx.png,err msg: list index out of range by @santlchogva in #11405
- rm QR code in the document by @tink2123 in #11511
- rm QR code by @tink2123 in #11533
- Update custom.md by @jzhang533 in #11636
- fix AttributeError by @GreatV in #11556
- update pre-commit config by @jzhang533 in #11682
- lazy import PyMuPDF by @jzhang533 in #11685
- setup a workflow for publishing package to pypi, and bump version to … by @jzhang533 in #11800
New Contributors
- @savikko made their first contribution in #10850
- @greyovo made their first contribution in #11030
- @santlchogva made their first contribution in #11405
- @jzhang533 made their first contribution in #11636
Full Changelog: v2.7.0...v2.7.2
PaddleOCRv2.7.1
New Projects
- Add Parseq recognition model.(#10836)
- Add text recognition function to return single character coordinates.(#10515)
New Features
- Add savefile option to save OCR output results.(#10628)
- Add more data preprocessing options to ppocr.py.(#10217)
- A single damaged image does not affect data set inference. (#10216)
- Compatible with fitz version. (#10181)
- Compatible with Pillow10.0 upgrade. (#10405)
- Add Finnish dictionary file. (#10850)
- Onnxruntime supports GPU. (#10668)
- TIPC supports XPU and NPU. (#10658, #10460)
- Add inference on mlu devices. (#10249)
BugFix
- Fixed cannot find the library error when packaged into exe on windows. (#10502)
- Fixed the bug of recognize page is affected by the maximum number of PDF files when recognize multiple PDF files. (#10290)
- Fixed the problem of PPOCRLabel startup failure caused by mismatch in channel number. (#10847)
- Fix memory leak problem of cpp inference. (#10441)
- Modify DSR error caused by data enhancement. (#10662)
- Fix training seed problem. (#10614)
- Fix table_master tipc error. (#10514)
- Fixed the problem of error reporting when ppocr.py uses wandb. (#10251)
- Fix memory leak in predict_rec.py. (#10688)
- Fixed the issue where dis and iou cannot be calculated correctly due to the index error of structure_boxes in the PaddleStructure::rebuild_table function. (#10810)
- Compatible with paddle 2.5 fluid exit. (#10391)
- Fix the performance problem of Tensor.numpy under stride. (#10773)
- Adapt the size of ABINet during export to the size of ABINetRecResizeImg. (#10892)
- ABINet training error. (#10744)
- Fix KeyError in ch_PP-OCRv4_rec_distill.yml. (#10761)
Documentations Fix
- Fix algorithm_kie_vi_layoutxlm_en.md, kie_datasets_en.md, README.md, algorithm_overview.md, how_to_do_kie_en.md document issues. (#10717)
- Update documentation issues with setup.py. (#10749)
- Add pyyaml library in requirements.txt. (#10653)
New Projects
New Features
- 增加savefile选项,保存ocr输出结果。(#10628)
- 增加ppocr.py 更多数据预处理选项。(#10217)
- 单张破损图片不影响整体数据集推理。(#10216)
- 兼容fitz版本。(#10181)
- 兼容Pillow10.0升级。(#10405)
- 增加芬兰语字典文件。(#10850)
- Onnxruntime 支持GPU。(#10668)
- TIPC支持XPU、NPU。(#10658,#10460)
- 增加在mlu设备上的推理。(#10249)
BugFix
- 修复windows打包成exe找不到库的问题。(#10502)
- 修复多次识别pdf,受第一次页面最大数量影响的bug。(#10290)
- 修复通道数不匹配造成的PPOCRLabel启动失败问题。(#10847)
- 修复cpp推理的内存泄漏问题。(#10441)
- 修改数据增强导致的DSR报错。(#10662)
- 修复训练seed传递问题。( #10614)
- 修复 table_master tipc 报错。(#10514)
- 修复ppocr.py使用wandb报错问题。(#10251)
- 修复predict_rec.py中的内存泄漏问题。(#10688)
- 修复PaddleStructure::rebuild_table函数中structure_boxes的索引错误导致dis和iou无法正确计算的问题。(#10810)
- 兼容paddle 2.5 fluid退场。 (#10391)
- 修复stride下Tensor.numpy的性能问题。(#10773)
- ABINet导出时尺寸适应 ABINetRecResizeImg 的尺寸。(#10892)
- ABINet训练报错问题。(#10744)
- 解决ch_PP-OCRv4_rec_distill.yml中的keyError报错问题。(#10761)
Documentations Fix
- 修复 algorithm_kie_vi_layoutxlm_en.md、kie_datasets_en.md 、README.md、algorithm_overview.md、how_to_do_kie_en.md文档问题(#10717)
- 更新setup.py的文档问题(#10749)
- 修复requirements.txt中没有pyyaml库的问题(#10653)
New Contributors
- @RuohengMa made their first contribution in #10614
- @WilliamQf-AI made their first contribution in #10628
- @xlg-go made their first contribution in #10744
- @Gmgge made their first contribution in #10777
- @victor30608 made their first contribution in #11025
Full Changelog: v2.7.0...v2.7.1
PaddleOCRv2.7.0
Release Note
- Release PP-OCRv4, support mobile version and server version
- PP-OCRv4-mobile:When the speed is comparable, the effect of the Chinese scene is improved by 4.5% compared with PP-OCRv3, the English scene is improved by 10%, and the average recognition accuracy of the 80-language multilingual model is increased by more than 8%.
- PP-OCRv4-server:Release the OCR model with the highest accuracy at present, the detection model accuracy increased by 4.9% in the Chinese and English scenes, and the recognition model accuracy increased by 2%
refer quickstart quick use by one line command, At the same time, the whole process of model training, reasoning, and high-performance deployment can also be completed with few code in the General OCR Industry Solution in PaddleX.
- ReleasePP-ChatOCR, a new scheme for extracting key information of general scenes using PP-OCR model and ERNIE LLM.
PaddleOCRv2.6.0
Release Note
- Release PP-Structurev2,with functions and performance fully upgraded, adapted to Chinese scenes, and new support for Layout Recovery and one line command to convert PDF to Word;
- Layout Analysis optimization: model storage reduced by 95%, while speed increased by 11 times, and the average CPU time-cost is only 41ms;
- Table Recognition optimization: 3 optimization strategies are designed, and the model accuracy is improved by 6% under comparable time consumption;
- Key Information Extraction optimization:a visual-independent model structure is designed, the accuracy of semantic entity recognition is increased by 2.8%, and the accuracy of relation extraction is increased by 9.1%.
PaddleOCRv2.5.0
Release Note
- Release PP-OCRv3: With comparable speed, the effect of Chinese scene is further improved by 5% compared with PP-OCRv2, the effect of English scene is improved by 11%, and the average recognition accuracy of 80 language multilingual models is improved by more than 5%.
- Release PPOCRLabelv2: Add the annotation function for table recognition task, key information extraction task and irregular text image.
- Release interactive e-book "Dive into OCR", covers the cutting-edge theory and code practice of OCR full stack technology.
PaddleOCRv2.1.1
PaddleOCRv2.1.0
Release Note
- Newly release end-to-end text recognition algorithm PGNet which is published in AAAI 2021. Find tutorial here.
- Newly release multi language recognition model, support more than 80 languages recognition. Find tutorial here.
- Optimize the performance of English recognition model.