Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recognize multiple rectangles on a page without clearing results inbetween #4366

Open
bleze opened this issue Dec 10, 2024 · 0 comments
Open

Comments

@bleze
Copy link

bleze commented Dec 10, 2024

Your Feature Request

Using SetRectangle and Recognize in a loop will clear what has been found between each run resulting in only the last result being returned. It would be nice if it would nice it was possible to keep all results by telling it not to clear. Alternatively give it a range of rectangles and not just one. The latter is probably faster, since overhead can be reduced.

In turn when inserting result via AddImage in a PDFRenderer, the page is automatically advanced to the next, so only one result can be inserted per page. It would also be nice if at least it was possible to tell it not/or when to advance, or which page you are adding to, thereby build the pages from OCR results.

These features would help me achieve what I estimate is 80% of the time spent, or put it in another way 5x speed. Unsure about memory saved, but might help here also.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants