fix: Enable quantization and compilation in the same optimization job via ModelBuilder and add validations to prevent compilation for Llama-3.1 on TRTLLM. #459
Triggered via pull request
September 19, 2024 22:00
Status
Success
Total duration
3m 20s
Artifacts
–