Support multiple deployment_ids for azure #300

henrikbjorn · 2023-08-11T08:04:23Z

This enables support for calling multiple azure deployments when using the library. This follows the same convention as the official python openai package.

So client.embeddings(deployment_id: 'my-gpt-deployment', ...rest_of_args)

Since a embedding model would be one deployment and the actual chat model would be another this is required to use both.

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Have you added an explanation of what your changes do and why you'd like us to include them?

alexrudall · 2023-08-11T08:31:12Z

Thanks for this. Looks like a breaking change for Azure users?

henrikbjorn · 2023-08-11T08:41:41Z

Thanks for this. Looks like a breaking change for Azure users?

Not really, they can just use the same base uri as always and then omit the deployment_id parameter from their method calls.

henrikbjorn · 2023-08-14T07:46:21Z

Sorry to ping you @alexrudall

Not sure what you want to do with the Rubycop warning

alexrudall · 2023-08-14T22:21:53Z

@henrikbjorn - this should now be solved in v5 of ruby-openai as you can create multiple Client objects each with different configuration (so you can set different URI as needed). Let me know if this doesn't solve your problem, and thank you for your contribution!

henrikbjorn · 2023-08-15T05:51:37Z

@henrikbjorn - this should now be solved in v5 of ruby-openai as you can create multiple Client objects each with different configuration (so you can set different URI as needed). Let me know if this doesn't solve your problem, and thank you for your contribution!

That dosen't solve the problem. It makes it complex and unneeded. Since I would have to have a Embedding Client, GPT-4 Client etc.

That does nok make sense to me

henrikbjorn · 2023-08-15T05:52:11Z

Since especially that usecase is even still supported with this code. Just set the base uri to the deployment directly and omit the deployment_id from the api calls

alexrudall · 2023-08-15T09:02:30Z

Hmm OK let me take another look

henrikbjorn · 2023-08-15T11:04:30Z

The path in json_post in only changed if deployment_id is present. Otherwise the code path is as before

themire · 2023-08-30T08:03:31Z

This is what I'm doing to support multiple models in Azure which is working fine.

module OpenAi
  class Client
    def self.client(deployment_id)
      OpenAI::Client.new(
        uri_base: "https://***.openai.azure.com/openai/deployments/#{deployment_id}"
      )
    end

    def self.default
      client("gpt-35-turbo")
    end

    def self.gpt35_16k
      client("gpt-35-16k")
    end

    def self.gpt4
      client("gpt-4-8k")
    end

    def self.embedding
      client("text-embedding-ada-002")
    end
  end
end



OpenAi::Client.gpt35_16k.chat(parameters: ...)

henrikbjorn · 2023-08-31T05:36:46Z

Either works, but I think it is a good idea to follow the same API as the python SDK, so the SDKs works the same

bmulholland · 2023-12-21T14:52:16Z

One minor annoyance with the current setup is that, even if creating multiple OpenAI clients, we have to duplicate and construct the uri each time. In that sense, the configuration for Azure isn't actually "URI base," but "full URI." Ideally, we'd configure the true "base" of the URI once (e.g. from an env var), and then set the deployment per client initialization. As a compromise, could we instead split out an optional deployment_id constructor arg that gets appended to the uri_base?

henrikbjorn · 2024-05-02T06:10:24Z

One minor annoyance with the current setup is that, even if creating multiple OpenAI clients, we have to duplicate and construct the uri each time. In that sense, the configuration for Azure isn't actually "URI base," but "full URI." Ideally, we'd configure the true "base" of the URI once (e.g. from an env var), and then set the deployment per client initialization. As a compromise, could we instead split out an optional deployment_id constructor arg that gets appended to the uri_base?

Which is what this PR does

Support multiple deployment_ids for azure

aa51665

alexrudall closed this Aug 14, 2023

alexrudall reopened this Aug 15, 2023

alexrudall added the azure label Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiple deployment_ids for azure #300

Support multiple deployment_ids for azure #300

henrikbjorn commented Aug 11, 2023

alexrudall commented Aug 11, 2023

henrikbjorn commented Aug 11, 2023

henrikbjorn commented Aug 14, 2023

alexrudall commented Aug 14, 2023

henrikbjorn commented Aug 15, 2023

henrikbjorn commented Aug 15, 2023

alexrudall commented Aug 15, 2023

henrikbjorn commented Aug 15, 2023

themire commented Aug 30, 2023

henrikbjorn commented Aug 31, 2023

bmulholland commented Dec 21, 2023

henrikbjorn commented May 2, 2024

Support multiple deployment_ids for azure #300

Are you sure you want to change the base?

Support multiple deployment_ids for azure #300

Conversation

henrikbjorn commented Aug 11, 2023

All Submissions:

alexrudall commented Aug 11, 2023

henrikbjorn commented Aug 11, 2023

henrikbjorn commented Aug 14, 2023

alexrudall commented Aug 14, 2023

henrikbjorn commented Aug 15, 2023

henrikbjorn commented Aug 15, 2023

alexrudall commented Aug 15, 2023

henrikbjorn commented Aug 15, 2023

themire commented Aug 30, 2023

henrikbjorn commented Aug 31, 2023

bmulholland commented Dec 21, 2023

henrikbjorn commented May 2, 2024