[REQUEST] Support x-grammar structured output framework integration #723

debasish-mihup · 2025-01-27T07:56:11Z

Problem

I am using exl2 in my environment to generate structured output (JSON). It works well. The issue is inferencing speed, as am unable to pass multi-instance filter to dynamic generator and thus unable to do batch inferencing.

On profiling I found the GIL is causing havoc on the throughput, as CPU usage is 100% and is a huge bottleneck. The system is unable to use additional CPUs and neither the framework lm-format-enforcer is under much active development recently.

Solution

I found there are framework for the same having much multi-instance inference support as well as significant speed-up on single thread performance.

xgrammar framework

Can you look into this or guide me to how to get started to integrate with your inference framework?

Integration Docs

Alternatives

No response

Explanation

New framework should supports:

Support batch inference support with structured output
Increase single thread performance, 3.5x speed-up on JSON.
Less memory intensive
Supports context-free generation

Paper

Blogpost

Examples

No response

Additional context

No response

Acknowledgements

I have looked for similar requests before submitting this one.
I understand that the developers have lives and my issue will be answered when possible.
I understand the developers of this program are human, and I will make my requests politely.

turboderp · 2025-02-02T13:26:02Z

I would see if you can't do what you want to do with Formatron. There's an example here. Formatron isn't hindered by the GIL and so you should have more luck with multiple jobs each running their own filter in a separate thread (as in the example).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Support x-grammar structured output framework integration #723

[REQUEST] Support x-grammar structured output framework integration #723

debasish-mihup commented Jan 27, 2025 •

edited

Loading

turboderp commented Feb 2, 2025

[REQUEST] Support x-grammar structured output framework integration #723

[REQUEST] Support x-grammar structured output framework integration #723

Comments

debasish-mihup commented Jan 27, 2025 • edited Loading

Problem

Solution

Alternatives

Explanation

Examples

Additional context

Acknowledgements

turboderp commented Feb 2, 2025

debasish-mihup commented Jan 27, 2025 •

edited

Loading