How to convert LlavaLlamaForCausalLM based models to GGUF #8532

dand-milestone · 2024-07-17T07:21:44Z

dand-milestone
Jul 17, 2024

Hey there,

my goal is to run LlavaLlamaForCausalLM based models on a jetson device through Ollama. As far as I could see there’s no “out-of-the-box” support to convert the model weights into the .gguf format without losing its vision component. I did some digging and found the /app/examples/convert_legacy_llama.py from llama.cpp but it still doesn’t do the trick for the vision tower.

Maybe I’m being ignorant about some obvious solution but so far I couldn’t get around to it. Any ideas how to convert the model properly?

Any pointers, guides, tutorials etc. to achieve the above goal will be much appreciated!

LordLokator · 2025-01-20T09:53:00Z

LordLokator
Jan 20, 2025

+1 on this it's really annoying

0 replies

illtellyoulater · 2025-01-20T22:39:27Z

illtellyoulater
Jan 20, 2025

also looking for this

0 replies

LordLokator · 2025-01-21T06:49:14Z

LordLokator
Jan 21, 2025

Some people say that either convert.py (deprecated / removed AFAIK) or the legacy llama converter (in './examples' if I remember correctly) should do the trick but for me neither works.

I want to test NVILA or some other Llama backboned VLM in Ollama and it's really frustrating to not be able to convert these models. Ollama has a Llava model available on it's website but there is no tool or tutorial on how to actually import a VLM. I think .safetensors should work as well but I haven't had success importing that yet.

0 replies

illtellyoulater · 2025-01-21T08:31:27Z

illtellyoulater
Jan 21, 2025

Just posted the solution here: #2948 (comment)

The Ollama/quantize tool finally did it for me! https://hub.docker.com/r/ollama/quantize

0 replies

ngxson · 2025-01-21T11:44:43Z

ngxson
Jan 21, 2025
Collaborator

convert_hf_to_gguf will work out of the box with llava after #11292

1 reply

LordLokator Jan 22, 2025

Thanks for the heads up, reading 11292 (and 8010) was really informative.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to convert LlavaLlamaForCausalLM based models to GGUF #8532

{{title}}

Replies: 5 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How to convert LlavaLlamaForCausalLM based models to GGUF #8532

dand-milestone Jul 17, 2024

Replies: 5 comments · 1 reply

LordLokator Jan 20, 2025

illtellyoulater Jan 20, 2025

LordLokator Jan 21, 2025

illtellyoulater Jan 21, 2025

ngxson Jan 21, 2025 Collaborator

LordLokator Jan 22, 2025

dand-milestone
Jul 17, 2024

Replies: 5 comments 1 reply

LordLokator
Jan 20, 2025

illtellyoulater
Jan 20, 2025

LordLokator
Jan 21, 2025

illtellyoulater
Jan 21, 2025

ngxson
Jan 21, 2025
Collaborator