Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #13283

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #13283

unit-tests

succeeded Feb 18, 2025 in 1h 25m 11s