Releases · PaddlePaddle/PaddleOCR

add finnish language files by @savikko in #10850
fix cls_x and bbox_x is possibly unbound by @SigureMo in #10973
update ppocrv4_framework by @tink2123 in #11047
Update ONNX conversion readme_ch.md by @greyovo in #11030
[TIPC]update tipc scripts and rm fluid api by @USTCKAY in #11098
fix a bug for rec_postprocess.py by @Ataraxy33 in #11408
Modify readme 27 by @zhangyubo0722 in #11424
fix: layout recovery image:xxx.png,err msg: list index out of range by @santlchogva in #11405
rm QR code in the document by @tink2123 in #11511
rm QR code by @tink2123 in #11533
Update custom.md by @jzhang533 in #11636
fix AttributeError by @GreatV in #11556
update pre-commit config by @jzhang533 in #11682
lazy import PyMuPDF by @jzhang533 in #11685
setup a workflow for publishing package to pypi, and bump version to … by @jzhang533 in #11800

New Contributors

@savikko made their first contribution in #10850
@greyovo made their first contribution in #11030
@santlchogva made their first contribution in #11405
@jzhang533 made their first contribution in #11636

Full Changelog: v2.7.0...v2.7.2

Contributors

jzhang533, savikko, and 8 other contributors

Assets 2

18 Oct 12:32

shiyutang

v2.7.1

8b60a9c

PaddleOCRv2.7.1

New Projects

Add Parseq recognition model.(#10836)
Add text recognition function to return single character coordinates.(#10515)

New Features

Add savefile option to save OCR output results.(#10628)
Add more data preprocessing options to ppocr.py.(#10217)
A single damaged image does not affect data set inference. (#10216)
Compatible with fitz version. (#10181)
Compatible with Pillow10.0 upgrade. (#10405)
Add Finnish dictionary file. (#10850)
Onnxruntime supports GPU. (#10668)
TIPC supports XPU and NPU. (#10658, #10460)
Add inference on mlu devices. (#10249)

BugFix

Fixed cannot find the library error when packaged into exe on windows. (#10502)
Fixed the bug of recognize page is affected by the maximum number of PDF files when recognize multiple PDF files. (#10290)
Fixed the problem of PPOCRLabel startup failure caused by mismatch in channel number. (#10847)
Fix memory leak problem of cpp inference. (#10441)
Modify DSR error caused by data enhancement. (#10662)
Fix training seed problem. (#10614)
Fix table_master tipc error. (#10514)
Fixed the problem of error reporting when ppocr.py uses wandb. (#10251)
Fix memory leak in predict_rec.py. (#10688)
Fixed the issue where dis and iou cannot be calculated correctly due to the index error of structure_boxes in the PaddleStructure::rebuild_table function. (#10810)
Compatible with paddle 2.5 fluid exit. (#10391)
Fix the performance problem of Tensor.numpy under stride. (#10773)
Adapt the size of ABINet during export to the size of ABINetRecResizeImg. (#10892)
ABINet training error. (#10744)
Fix KeyError in ch_PP-OCRv4_rec_distill.yml. (#10761)

Documentations Fix

Fix algorithm_kie_vi_layoutxlm_en.md, kie_datasets_en.md, README.md, algorithm_overview.md, how_to_do_kie_en.md document issues. (#10717)
Update documentation issues with setup.py. (#10749)
Add pyyaml library in requirements.txt. (#10653)

New Projects

增加Parseq 识别模型。（#10836）
增加文字识别返回单字识别坐标功能。（#10515）

New Features

增加savefile选项，保存ocr输出结果。（#10628）
增加ppocr.py 更多数据预处理选项。（#10217）
单张破损图片不影响整体数据集推理。（#10216）
兼容fitz版本。（#10181）
兼容Pillow10.0升级。（#10405）
增加芬兰语字典文件。（#10850）
Onnxruntime 支持GPU。（#10668）
TIPC支持XPU、NPU。（#10658，#10460）
增加在mlu设备上的推理。（#10249）

BugFix

修复windows打包成exe找不到库的问题。（#10502）
修复多次识别pdf，受第一次页面最大数量影响的bug。（#10290）
修复通道数不匹配造成的PPOCRLabel启动失败问题。（#10847）
修复cpp推理的内存泄漏问题。（#10441）
修改数据增强导致的DSR报错。(#10662）
修复训练seed传递问题。（ #10614）
修复 table_master tipc 报错。（#10514）
修复ppocr.py使用wandb报错问题。（#10251）
修复predict_rec.py中的内存泄漏问题。（#10688）
修复PaddleStructure::rebuild_table函数中structure_boxes的索引错误导致dis和iou无法正确计算的问题。（#10810）
兼容paddle 2.5 fluid退场。（#10391）
修复stride下Tensor.numpy的性能问题。（#10773）
ABINet导出时尺寸适应 ABINetRecResizeImg 的尺寸。（#10892）
ABINet训练报错问题。（#10744）
解决ch_PP-OCRv4_rec_distill.yml中的keyError报错问题。（#10761）

Documentations Fix

修复 algorithm_kie_vi_layoutxlm_en.md、kie_datasets_en.md 、README.md、algorithm_overview.md、how_to_do_kie_en.md文档问题（#10717）
更新setup.py的文档问题（#10749）
修复requirements.txt中没有pyyaml库的问题（#10653）

New Contributors

@RuohengMa made their first contribution in #10614
@WilliamQf-AI made their first contribution in #10628
@xlg-go made their first contribution in #10744
@Gmgge made their first contribution in #10777
@victor30608 made their first contribution in #11025

Full Changelog: v2.7.0...v2.7.1

Contributors

xlg-go, WilliamQf-AI, and 3 other contributors

Assets 2

22 Sep 07:27

tink2123

v2.7.0

19ad3d9

PaddleOCRv2.7.0

Release Note

Release PP-OCRv4, support mobile version and server version
- PP-OCRv4-mobile：When the speed is comparable, the effect of the Chinese scene is improved by 4.5% compared with PP-OCRv3, the English scene is improved by 10%, and the average recognition accuracy of the 80-language multilingual model is increased by more than 8%.
- PP-OCRv4-server：Release the OCR model with the highest accuracy at present, the detection model accuracy increased by 4.9% in the Chinese and English scenes, and the recognition model accuracy increased by 2%
  refer quickstart quick use by one line command, At the same time, the whole process of model training, reasoning, and high-performance deployment can also be completed with few code in the General OCR Industry Solution in PaddleX.
ReleasePP-ChatOCR, a new scheme for extracting key information of general scenes using PP-OCR model and ERNIE LLM.

Assets 2

24 Aug 09:04

MissPenguin

v2.6.0

56aaead

PaddleOCRv2.6.0

Release Note

Release PP-Structurev2，with functions and performance fully upgraded, adapted to Chinese scenes, and new support for Layout Recovery and one line command to convert PDF to Word;
Layout Analysis optimization: model storage reduced by 95%, while speed increased by 11 times, and the average CPU time-cost is only 41ms;
Table Recognition optimization: 3 optimization strategies are designed, and the model accuracy is improved by 6% under comparable time consumption;
Key Information Extraction optimization：a visual-independent model structure is designed, the accuracy of semantic entity recognition is increased by 2.8%, and the accuracy of relation extraction is increased by 9.1%.

Assets 2

09 May 11:48

MissPenguin

v2.5.0

460b1e8

PaddleOCRv2.5.0

Release Note

Release PP-OCRv3: With comparable speed, the effect of Chinese scene is further improved by 5% compared with PP-OCRv2, the effect of English scene is improved by 11%, and the average recognition accuracy of 80 language multilingual models is improved by more than 5%.
Release PPOCRLabelv2: Add the annotation function for table recognition task, key information extraction task and irregular text image.
Release interactive e-book "Dive into OCR", covers the cutting-edge theory and code practice of OCR full stack technology.

Assets 2

26 May 11:43

MissPenguin

v2.1.1

83a5f46

PaddleOCRv2.1.1

Release Note

Newly release model pruning and model quantization tools based on PaddleSlim. Path
Newly release mobile deployment tools based on Paddle-Lite. Path
Newly release Android demo of ppocr system. path
Newly release service deployment based on Paddle Serving. path

Assets 2

19 Apr 06:46

MissPenguin

v2.1.0

ff9f8ec

PaddleOCRv2.1.0

Release Note

Newly release end-to-end text recognition algorithm PGNet which is published in AAAI 2021. Find tutorial here.
Newly release multi language recognition model, support more than 80 languages recognition. Find tutorial here.
Optimize the performance of English recognition model.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

What's Changed

New Contributors

Contributors

New Projects

New Features

BugFix

Documentations Fix

New Projects

New Features

BugFix

Documentations Fix

New Contributors

Contributors

Releases: PaddlePaddle/PaddleOCR

PaddleOCRv2.7.5

PaddleOCRv2.7.4

PaddleOCRv2.7.3

What's Changed

PaddleOCRv2.7.2

What's Changed

New Contributors

Contributors

PaddleOCRv2.7.1

New Projects

New Features

BugFix

Documentations Fix

New Projects

New Features

BugFix

Documentations Fix

New Contributors

Contributors

PaddleOCRv2.7.0

PaddleOCRv2.6.0

PaddleOCRv2.5.0

PaddleOCRv2.1.1

PaddleOCRv2.1.0