Skip to content

python3.pkgs.paddle{ocr,x}: various improvements#475820

Merged
doronbehar merged 10 commits into
NixOS:masterfrom
doronbehar:pkg/python-paddleocr
Jan 7, 2026
Merged

python3.pkgs.paddle{ocr,x}: various improvements#475820
doronbehar merged 10 commits into
NixOS:masterfrom
doronbehar:pkg/python-paddleocr

Conversation

@doronbehar
Copy link
Copy Markdown
Contributor

@doronbehar doronbehar commented Jan 1, 2026

Things done

  • Built on platform:
    • x86_64-linux
    • aarch64-linux
    • x86_64-darwin
    • aarch64-darwin
  • Tested, as applicable:
  • Ran nixpkgs-review on this PR. See nixpkgs-review usage.
  • Tested basic functionality of all binary files, usually in ./result/bin/.
  • Nixpkgs Release Notes
    • Package update: when the change is major or breaking.
  • NixOS Release Notes
    • Module addition: when adding a new NixOS module.
    • Module update: when the change is significant.
  • Fits CONTRIBUTING.md, pkgs/README.md, maintainers/README.md and other READMEs.

Add a 👍 reaction to pull requests you find important.

@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from fb3b788 to 19eb5a4 Compare January 1, 2026 08:09
@nixpkgs-ci nixpkgs-ci Bot requested review from kyehn, natsukium and osbm January 1, 2026 08:14
@nixpkgs-ci nixpkgs-ci Bot added 10.rebuild-linux: 501+ This PR causes many rebuilds on Linux and should normally target the staging branches. 10.rebuild-darwin: 101-500 This PR causes between 101 and 500 packages to rebuild on Darwin. 10.rebuild-linux: 1001-2500 This PR causes many rebuilds on Linux and should target the staging branches. 6.topic: python Python is a high-level, general-purpose programming language. labels Jan 1, 2026
@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from 19eb5a4 to 1b64e8a Compare January 1, 2026 08:18
@qzylinra
Copy link
Copy Markdown
Contributor

qzylinra commented Jan 1, 2026

Is the large amount of rebuilding due to datasets? Is it necessary?

@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from 1b64e8a to e641977 Compare January 1, 2026 15:54
@nixpkgs-ci nixpkgs-ci Bot added 10.rebuild-linux: 11-100 This PR causes between 11 and 100 packages to rebuild on Linux. 10.rebuild-darwin: 11-100 This PR causes between 11 and 100 packages to rebuild on Darwin. and removed 10.rebuild-linux: 501+ This PR causes many rebuilds on Linux and should normally target the staging branches. 10.rebuild-darwin: 101-500 This PR causes between 101 and 500 packages to rebuild on Darwin. 10.rebuild-linux: 1001-2500 This PR causes many rebuilds on Linux and should target the staging branches. labels Jan 1, 2026
@doronbehar
Copy link
Copy Markdown
Contributor Author

Is the large amount of rebuilding due to datasets? Is it necessary?

Indeed it seems so, and it is not even necessary for this PR :) (I had a messy rebase after staging-next got merged). Here's the update of it anyway:

Comment thread pkgs/development/python-modules/paddlex/default.nix
@nixpkgs-ci nixpkgs-ci Bot added the 12.approvals: 1 This PR was reviewed and approved by one person. label Jan 1, 2026
Comment thread pkgs/development/python-modules/bce-python-sdk/default.nix Outdated
@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from e641977 to d32c1a5 Compare January 4, 2026 14:05
@doronbehar doronbehar requested a review from qzylinra January 4, 2026 15:02
@qzylinra
Copy link
Copy Markdown
Contributor

qzylinra commented Jan 4, 2026

What is the purpose of adding premailer here?

@doronbehar
Copy link
Copy Markdown
Contributor Author

What is the purpose of adding premailer here?

Good question! It is needed in order to use paddleocr. In an imperative setup, if paddlex[ocr] is installed, it installs premailer too:

https://github.com/PaddlePaddle/PaddleX/blob/37f1ffdc1daae40bbc5e17141d064951be9796ed/setup.py#L169-L177

@qzylinra
Copy link
Copy Markdown
Contributor

qzylinra commented Jan 4, 2026

In fact, only the document_to_workbook function needs it. https://github.com/PaddlePaddle/PaddleX/blob/37f1ffdc1daae40bbc5e17141d064951be9796ed/paddlex/inference/utils/io/tablepyxl.py#L111

Will it work correctly? #348580

@doronbehar
Copy link
Copy Markdown
Contributor Author

Well surprisingly you are right - although the premailer dependency appears in that upstream OCR list, in my development environment it wasn't needed. So I removed it. Thanks for the link.

@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from d32c1a5 to 9a8c01e Compare January 4, 2026 18:01
Comment thread pkgs/development/python-modules/bce-python-sdk/default.nix Outdated
@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from 9a8c01e to 0d58c5d Compare January 5, 2026 09:50
@doronbehar doronbehar force-pushed the pkg/python-paddleocr branch from 0d58c5d to 1a67e1d Compare January 5, 2026 09:51
@doronbehar
Copy link
Copy Markdown
Contributor Author

Thank you very much for your reviews and the approval. I'd like to merge this tomorrow if there are no further objections.

@doronbehar doronbehar added this pull request to the merge queue Jan 7, 2026
Merged via the queue into NixOS:master with commit a8b47f9 Jan 7, 2026
28 of 30 checks passed
@doronbehar doronbehar deleted the pkg/python-paddleocr branch January 7, 2026 11:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

6.topic: python Python is a high-level, general-purpose programming language. 10.rebuild-darwin: 11-100 This PR causes between 11 and 100 packages to rebuild on Darwin. 10.rebuild-linux: 11-100 This PR causes between 11 and 100 packages to rebuild on Linux. 12.approvals: 1 This PR was reviewed and approved by one person.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants