TPS, result.poller and 429 errors when using python begin_classify_document from Document Intelligence SDK #39643

mennolaan · 2025-02-10T07:34:13Z

Hi,

We have build a solution where we utilize the client_async.begin_classify_document from the Document Intelligence Python SDK.

When we experienced 429 errors we implemented a load balance logic by using semaphore and async logic. This way we keep the initial paralel calls to 14 (1 lower than max tps) and make sure we can only have 14 simultaneous transactions per second.

However, we still seem to receive the 429 http errors.

In the documentation it isn't clear how to approach the TPS by definition. Do we need to assume that result.poller also contributes to the TPS? There isn't a way for us to control the amount of latency in poller. In our usecase we have pdf's of different sizes, and thus we stream it to the endpoint. Therefor we do not know how long a classification takes. And we do not know how many times the poller will try to fetch the end result.

Obviously the best practices state, implement retry logic. But that feels like a bandage solution. We would like to actually have a better grasp of the expected output and prevent any 429 as much as possible. This will also benefit the backend so it doesn't have to send 429 all the time.

What is the life cycle of a singular TPS? for begin classify to poller.result.

I saw similar mention here: #35952 Is there any progress on this?

xiangyan99 · 2025-02-10T14:43:59Z

Thanks for reaching out, we’ll investigate asap.

github-actions · 2025-02-10T17:34:20Z

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @bojunehsu @vkurpad.

TFR258 · 2025-02-10T17:58:44Z

@mennolaan , could you reach out to [email protected], quoting this ticket, and adding your azure resource id?
We should be able to troubleshoot this issue further.

Thanks

github-actions · 2025-02-12T21:33:01Z

Hi @mennolaan. Thank you for opening this issue and giving us the opportunity to assist. To help our team better understand your issue and the details of your scenario please provide a response to the question asked above or the information requested above. This will help us more accurately address your issue.

xiangyan99 added Client This issue points to a problem in the data-plane of the library. Document Intelligence and removed needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. labels Feb 10, 2025

github-actions bot added the needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team label Feb 10, 2025

kristapratico added the Service Attention Workflow: This issue is responsible by Azure service team. label Feb 10, 2025

kristapratico added the needs-author-feedback Workflow: More information is needed from author to address the issue. label Feb 12, 2025

github-actions bot removed the needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team label Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TPS, result.poller and 429 errors when using python begin_classify_document from Document Intelligence SDK #39643

TPS, result.poller and 429 errors when using python begin_classify_document from Document Intelligence SDK #39643

mennolaan commented Feb 10, 2025

xiangyan99 commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

TFR258 commented Feb 10, 2025 •

edited

Loading

github-actions bot commented Feb 12, 2025

TPS, result.poller and 429 errors when using python begin_classify_document from Document Intelligence SDK #39643

TPS, result.poller and 429 errors when using python begin_classify_document from Document Intelligence SDK #39643

Comments

mennolaan commented Feb 10, 2025

xiangyan99 commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

TFR258 commented Feb 10, 2025 • edited Loading

github-actions bot commented Feb 12, 2025

TFR258 commented Feb 10, 2025 •

edited

Loading