Fix for Exception raised while parsing Chat Completions streaming response, in some rare cases #39741

dargilco · 2025-02-14T04:53:13Z

Description

I finally got one repro of the GitHub issue, while taking SDK logs and using DeepSeek model. That confirmed what I suspected. But I could not understand why the unit-test I added a couple of days ago, streaming Chinese characters broken across lines, did not exhibit the same issue. That unit-test passed.

After investigation turns out I had a bug in the unit-test! Missing comma after one of the SSE lines in the input array for the test. Which miraculously made the unit-test pass, when they should have failed exactly as reported in the GitHub issue.

After I discovered that, it was easy to implement a fix. I updated the logic in the SSE parsing to make sure UTF-8 decoding was moved further down, at the point where we are guaranteed we have a complete line of JSON string, just before deserializing it into the output chunk object. Until that point (including caching the previous incomplete line) the input is still handled as a "bytes" object.

I also updated all streaming samples with extra checks during printing of the streaming response and also printed the token usage. We already did that in the GitHub samples.

sdk/ai/azure-ai-inference/azure/ai/inference/models/_patch.py

…' into dargilco/azure-ai-inference-pr-utf8-streaming-fix-part2

azure-sdk · 2025-02-14T16:37:04Z

API change check

APIView has identified API level changes in this PR and created following API reviews.

azure-ai-inference

First

73276b1

dargilco requested review from trangevi and jhakulin as code owners February 14, 2025 04:53

dargilco requested a review from howieleung February 14, 2025 04:53

dargilco self-assigned this Feb 14, 2025

github-actions bot added the AI Model Inference Issues related to the client library for Azure AI Model Inference (\sdk\ai\azure-ai-inference) label Feb 14, 2025

nick863 approved these changes Feb 14, 2025

View reviewed changes

howieleung reviewed Feb 14, 2025

View reviewed changes

sdk/ai/azure-ai-inference/azure/ai/inference/models/_patch.py Show resolved Hide resolved

dargilco added 5 commits February 13, 2025 21:58

Fix quality gates

fb7ad51

Update code snippets in README.md

c9b0c87

Set release date

e7f2784

Update token use in samples

08061d2

Merge remote-tracking branch 'origin/feature/azure-ai-inference-beta9…

33ea878

…' into dargilco/azure-ai-inference-pr-utf8-streaming-fix-part2

dargilco added 2 commits February 14, 2025 08:43

fix cspell errors

8a51b31

Fix failing tests on Python 3.8

c266279

dargilco merged commit 3dc6388 into feature/azure-ai-inference-beta9 Feb 14, 2025
21 of 24 checks passed

dargilco deleted the dargilco/azure-ai-inference-pr-utf8-streaming-fix-part2 branch February 14, 2025 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for Exception raised while parsing Chat Completions streaming response, in some rare cases #39741

Fix for Exception raised while parsing Chat Completions streaming response, in some rare cases #39741

dargilco commented Feb 14, 2025

azure-sdk commented Feb 14, 2025

Fix for Exception raised while parsing Chat Completions streaming response, in some rare cases #39741

Fix for Exception raised while parsing Chat Completions streaming response, in some rare cases #39741

Conversation

dargilco commented Feb 14, 2025

Description

azure-sdk commented Feb 14, 2025