Skip to content

Enabled Qwen2-MoE Tensor Parallelism (TP) inference #11526

Enabled Qwen2-MoE Tensor Parallelism (TP) inference

Enabled Qwen2-MoE Tensor Parallelism (TP) inference #11526

Triggered via pull request October 8, 2024 23:06
Status Success
Total duration 27m 17s
Artifacts

nv-accelerate-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in