-
-
Notifications
You must be signed in to change notification settings - Fork 769
Integrate PaddleOCR as an alternative OCR engine #18663
Copy link
Copy link
Open
Labels
feature/ocrRelated to Optical Character Recognition feature in NVDARelated to Optical Character Recognition feature in NVDAp5https://github.com/nvaccess/nvda/blob/master/projectDocs/issues/triage.md#priorityhttps://github.com/nvaccess/nvda/blob/master/projectDocs/issues/triage.md#prioritytriagedHas been triaged, issue is waiting for implementation.Has been triaged, issue is waiting for implementation.
Metadata
Metadata
Assignees
Labels
feature/ocrRelated to Optical Character Recognition feature in NVDARelated to Optical Character Recognition feature in NVDAp5https://github.com/nvaccess/nvda/blob/master/projectDocs/issues/triage.md#priorityhttps://github.com/nvaccess/nvda/blob/master/projectDocs/issues/triage.md#prioritytriagedHas been triaged, issue is waiting for implementation.Has been triaged, issue is waiting for implementation.
Is your feature request related to a problem? Please describe.
PaddleOCR is probably the best open-source traditional OCR out there right now. It works way better than Windows OCR. By switching between different models, it can recognize over 80 languages. You can even use custom models to make it even better.
Describe the solution you'd like
Use WinML to run the PaddleOCR model converted to ONNX.
Currently, there are no open source projects related to WinML inference deployment, so you may need to write the relevant code yourself.
Related: #18662
Describe alternatives you've considered
In the future, when we introduce ONNXRuntime, we can also consider using ONNXRuntime directly.
Currently, there are some PaddleOCR projects that use ONNXRuntime, which may be directly integrated.
Additional context