Florence-2 QLoRA support #108

0xD4rky · 2025-01-11T17:49:02Z

Search before asking

I have searched the Multimodal Maestro issues and found no similar feature requests.

Description

I was going through the maestro repo and found out that both paligemma and florence models didn't support the implementation of 4-bit quantization (i.e. using QLoRA config).

Use case

Using QLoRA, we could easily fine-tune vision language models on even low end devices without losing on precision a lot. As the models grow, we would eventually need to implement QLoRA to make finetuning fast and possible on memory constraints.

Additional

I would like to learn your take on implementing quantization.

Are you willing to submit a PR?

Yes I'd like to help by submitting a PR!

SkalskiP · 2025-02-06T14:26:38Z

Hi @0xD4rky 👋🏻 Thank you for your interest in maestro. Sorry for the late reply, I've been heavily involved in delivering maestro-1.0.0 over the past few weeks. I've managed to add QLoRA support for PaliGemma 2 and Qwen2.5-VL. Unfortunately, for Florence-2 we only have LoRA for now. It would be great if someone in the community would like to add QLoRA.

@0xD4rky I saw that you checked the Yes I'd like to help by submitting a PR! checkbox. Would you like to give it a try?

0xD4rky · 2025-02-06T18:26:47Z

Yes @SkalskiP, it would be absolutely great if I am able to contribute towards maestro. Will go through the code base, develop the QLoRA approach for Florence-2 and then raise the PR. This works?

SkalskiP · 2025-02-06T19:48:45Z

@0xD4rky that would be awesome! 🔥 I'm excited. We are building something cool here.

SkalskiP · 2025-02-07T14:56:59Z

@0xD4rky I'll assign this issue to you!

BTW I have a question: I'm thinking about launching a Discord server dedicated to VLM fine-tuning with Maestro) and while talking about current issues I'm trying to understand if people would like such a server to be created.

0xD4rky · 2025-02-09T13:50:22Z

@SkalskiP, people would love to hop on a Roboflow's discord server in general. It would be great to discuss the fine-tunings and approaches using Maestro, people would be definitely interested!

SkalskiP · 2025-02-10T15:10:19Z

@0xD4rky sounds great; I'll try to set something up this week! By the way, I updated the contributors guidelines today. Take a look, considering you want to add support for Florence-2 QLoRA.

0xD4rky · 2025-02-10T16:31:09Z

@SkalskiP sure, I will go through it before starting to contribute. Will keep you in touch with all the changes that I come up with

0xD4rky added the enhancement New feature or request label Jan 11, 2025

SkalskiP added the model Request to add / extend support for the model. label Feb 6, 2025

SkalskiP changed the title ~~fine-tuning via quantization~~ Florence-2 QLoRA support Feb 6, 2025

SkalskiP added the help wanted Extra attention is needed label Feb 6, 2025

SkalskiP assigned 0xD4rky Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Florence-2 QLoRA support #108

Florence-2 QLoRA support #108

0xD4rky commented Jan 11, 2025

SkalskiP commented Feb 6, 2025

0xD4rky commented Feb 6, 2025

SkalskiP commented Feb 6, 2025

SkalskiP commented Feb 7, 2025

0xD4rky commented Feb 9, 2025

SkalskiP commented Feb 10, 2025

0xD4rky commented Feb 10, 2025

Florence-2 QLoRA support #108

Florence-2 QLoRA support #108

Comments

0xD4rky commented Jan 11, 2025

Search before asking

Description

Use case

Additional

Are you willing to submit a PR?

SkalskiP commented Feb 6, 2025

0xD4rky commented Feb 6, 2025

SkalskiP commented Feb 6, 2025

SkalskiP commented Feb 7, 2025

0xD4rky commented Feb 9, 2025

SkalskiP commented Feb 10, 2025

0xD4rky commented Feb 10, 2025