[REQUEST] Multi gpu conversion #715

IMbackK · 2025-01-14T23:58:48Z

Problem

The measurement stage of conversion takes pretty long.

Solution

Use multiprocessing to spawn n processes to do the measurement on all available gpus.

Alternatives

No response

Explanation

it would help if multiple gpus could be involved in conversion by measuring n tensors at a time across how ever many gpus are available.

Examples

No response

Additional context

No response

Acknowledgements

I have looked for similar requests before submitting this one.
I understand that the developers have lives and my issue will be answered when possible.
I understand the developers of this program are human, and I will make my requests politely.

IMbackK · 2025-01-15T00:00:36Z

Another easy win would be to allow the user to exclude some configurations from the measurement. When quantizeoing at 8bpw its unliekly 2.12 bpw will get any wins so the user could exclude it from the measurement.

Lissanro · 2025-02-05T03:40:33Z

I recently started creating my own EXL2 quants, and I encountered this issue as well - only one GPU is used (out four GPUs I have) during the measurement.

I also noticed that it generated a quant layer by layer, so I think potentially not only measurements, but also conversion itself could be performed using all GPUs available.

On my configuration, it would be 4x improvement in speed, which would be a huge performance improvement and would make creating EXL2 quants much easier.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Multi gpu conversion #715

[REQUEST] Multi gpu conversion #715

IMbackK commented Jan 14, 2025

IMbackK commented Jan 15, 2025

Lissanro commented Feb 5, 2025

[REQUEST] Multi gpu conversion #715

[REQUEST] Multi gpu conversion #715

Comments

IMbackK commented Jan 14, 2025

Problem

Solution

Alternatives

Explanation

Examples

Additional context

Acknowledgements

IMbackK commented Jan 15, 2025

Lissanro commented Feb 5, 2025