Support for More LLms / configurable ExcecutableID #243

MichaelSchmid-AGI · 2024-10-25T10:22:23Z

Describe the Problem

Hi there,
i am struggling to get LLMS running that arent from OpenAI with your langchain module. Per default you seem to filter out LLMS which are in a different "excecutableId" than "azure-openai"

This makes using different Models seemingly impossible (for now)

Propose a Solution

I would suggest that you allow to pass a excecutableId when initializing a langchain chat client

const chatClient = new AzureOpenAiChatClient({
  modelName: 'meta--llama3.1-70b-instruct',
  excecutableID:'aicore-opensource'
});

this should allow the usage of other models quite easily without having to rewrite much.

The chat-completion api ( for example when using llama or mixtral) seems identical to payloads working with OpenAI models.

POST {baseurl}/v2/inference/deployments/{deploymentID}/chat/completions
Body for GPT 4o:

{
    "messages": [
        {
            "role": "user",
            "content": "test"
        }
    ],
    "model": "gpt-4o", 
    "max_tokens": 100,
    "temperature": 0.0,
    "frequency_penalty": 0,
    "presence_penalty": 0,
    "stop": "null"
}

Body For Llama 3.1 Instruct

{
  "messages": [
    {
      "role": "user",
      "content": "test"
    }
  ],
  "model": "meta--llama3.1-70b-instruct", 
  "max_tokens": 100,
  "temperature": 0.0,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "stop": "null"
}

Describe Alternatives

No response

Affected Development Phase

Development

Impact

Inconvenience

Timeline

No response

Additional Context

No response

The text was updated successfully, but these errors were encountered:

jjtang1985 · 2024-10-27T23:07:17Z

Hi @MichaelSchmid-AGI ,
Thanks for reaching out.

I guess the feature request is about supporting more direct LLM consumption.
I'm not sure whether we will reuse the same AzureOpenAiChatClient, though.

Are you aware our orchestration package: @sap-ai-sdk/orchestration and the orchestration service?
If you want to consume other LLMs, you can use the harmonised API offered by orchestration service now.

skye0402 · 2024-12-11T06:33:56Z

@jjtang1985 the idea of using orchestration is good to have a unified entry point. However, I'm not clear how multimodal capabilities are provided by the orchestration. I tried to pass an image into it and it gave a strange error. I think it's just not working. Also it might add unwanted latency. Next, not sure how compatible it is with langchain? So far I need to rely on the Python SDK as it offers most models. Would be great to see Anthropic and Gemini supported soon.

Mohan-Sharma · 2024-12-17T05:19:16Z

@jjtang1985 the idea of using orchestration is good to have a unified entry point. However, I'm not clear how multimodal capabilities are provided by the orchestration. I tried to pass an image into it and it gave a strange error. I think it's just not working. Also it might add unwanted latency. Next, not sure how compatible it is with langchain? So far I need to rely on the Python SDK as it offers most models. Would be great to see Anthropic and Gemini supported soon.

Agreed, it’s crucial that the ai-sdk-js supports ChatVertexAI, ChatAnthropic, BedrockEmbeddings, and VertexAIEmbeddings. Even if we opt for Orchestration, the @sap-ai-sdk/orchestration module’s lack of support for LangChain adds unnecessary complexity, making it cumbersome to work with while leveraging the AI SDK.

jjtang1985 · 2025-01-09T21:55:53Z

Hi @skye0402 ,

Thank you for reaching out!

I tried to pass an image into it and it gave a strange error.

Using image as input, as a feature has been supported by orchestration from last month.
We are also looking at it.

Also it might add unwanted latency.

As far as I know, the team did some tests and there are no major findings in terms of the latency.
Please open an issue for AI core, if you detect something.

Next, not sure how compatible it is with langchain?

So far, not supported.
Here is the feature request, please add your detailed requirements there, if possible.

jjtang1985 · 2025-01-09T21:57:04Z

Hi @Mohan-Sharma ,

Even if we opt for Orchestration, the @sap-ai-sdk/orchestration module’s lack of support for LangChain adds unnecessary complexity, making it cumbersome to work with while leveraging the AI SDK.

Here is the #176, please add your detailed requirements there, if possible.

Thank you!

MichaelSchmid-AGI added the feature request New feature or request label Oct 25, 2024

jjtang1985 added the author action label Oct 27, 2024

jjtang1985 removed the author action label Jan 9, 2025

jjtang1985 added the author action label Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for More LLms / configurable ExcecutableID #243

Support for More LLms / configurable ExcecutableID #243

MichaelSchmid-AGI commented Oct 25, 2024 •

edited

Loading

jjtang1985 commented Oct 27, 2024

skye0402 commented Dec 11, 2024

Mohan-Sharma commented Dec 17, 2024

jjtang1985 commented Jan 9, 2025 •

edited

Loading

jjtang1985 commented Jan 9, 2025

Support for More LLms / configurable ExcecutableID #243

Support for More LLms / configurable ExcecutableID #243

Comments

MichaelSchmid-AGI commented Oct 25, 2024 • edited Loading

Describe the Problem

Propose a Solution

Describe Alternatives

Affected Development Phase

Impact

Timeline

Additional Context

jjtang1985 commented Oct 27, 2024

skye0402 commented Dec 11, 2024

Mohan-Sharma commented Dec 17, 2024

jjtang1985 commented Jan 9, 2025 • edited Loading

jjtang1985 commented Jan 9, 2025

MichaelSchmid-AGI commented Oct 25, 2024 •

edited

Loading

jjtang1985 commented Jan 9, 2025 •

edited

Loading