Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #13283

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #13283

Re-run triggered February 18, 2025 09:58
Status Success
Total duration 1h 25m 19s
Artifacts

nv-torch-latest-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in