VESSL AI LLMProvider integration #17414

nsd9696 · 2025-01-03T05:41:18Z

Description

Integrating VESSL AI LLMProvider to use in Llama index.
VESSL AI provider will be served through vLLM and OpenAI Compatible.
User can manually serve their own huggingface model with 1) model_name, 2) vessl yaml_file or 3) connect with pre-served vessl llm service endpoint.
Example

from llama_index.llms.vesslai import VesslAILLM

llm = VesslAILLM()

#1 Serve with hf model name
llm.serve(
    service_name = "llama-index-vesslai",
    model_name = "mistralai/Mistral-7B-Instruct-v0.3",
    hf_token = "HF_TOKEN",
    api_key="openai-api-key"
)
#2 Serve with yaml file
llm.serve(
    service_name = "llama-index-vesslai",
    yaml_path="/users/own/vessl/service.yaml",
    api_key="openai-api-key"
)
#3 Connect with pre-served endpoint
llm.connect(
    served_model_name="mistralai/Mistral-7B-Instruct-v0.3",
    endpoint = "https://model-service-gateway-abc.oregon.google-cluster.vessl.ai/v1",
)
resp = llm.complete("Who is Paul Graham?")

Fixes # (issue)

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.

I added new unit tests to cover this change
I believe this change is already covered by existing unit tests

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

vesslai integration

review-notebook-app · 2025-01-03T05:41:23Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

llama-index-integrations/llms/llama-index-llms-vesslai/poetry.lock

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/BUILD

logan-markewich · 2025-01-03T15:14:34Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/base.py

+
+        self.organization_name = organization_name
+
+    def serve(


Curious about the decision to do serve and connect outside of the __init__() function? Do your users often switch this after the llm object is created? In most llama-index LLMs, you would just do llm = VesslAILLM(...) and then from there you can directly use it

Its fine either way tbh, was just curious

Thank you for the feedback. To use VESSL, authentication through configure is required. I wanted to handle this process during initialization and explicitly separate the serving and connection of the llm_provider afterward. Internally at VESSL, we have discussed this flow, and it seems to be fine.

logan-markewich · 2025-01-03T15:16:43Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/base.py

+        llm = VesslAILLM()
+
+        #1 Serve hf model name
+        llm.serve(


Thoughts on making serve and connect async? Seems like this could possibly be a blocking operation with wait_for_gateway_enabled ?

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index_vesslai_example.ipynb

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/utils.py

logan-markewich · 2025-01-03T15:18:29Z

llama-index-integrations/llms/llama-index-llms-vesslai/tests/test_llms_vesslai.py

There's quite a lot of code, is any of it testable? (you'd have to mock out api calls though)

Unfortunately, mocking the API for our service is quite complicated at the moment. We will consider adding it after the merge if necessary.

* Apply comments from llama_index * change into async * update readme * update poetry.lock and ipynb example

nsd9696 added 15 commits December 8, 2024 05:33

vesslai integration

2e959de

add configure logic

985f1ff

set as class variables

e910616

change to deployed vessl

df12c44

fix default yaml

55477fa

apply comments

c50a6b5

code refactor

c88d096

if running service, connect

33268be

apply comments

0534a1b

ensure_service_idempotence with yaml_str

1c812bb

Merge pull request #1 from vessl-ai/vessl-integration

8bf80f1

vesslai integration

Merge branch 'run-llama:main' into main

dfa893c

Merge branch 'run-llama:main' into main

a03937f

add temporary yaml for model_name serve

1c7ebc5

fix: make format and lint

c9c06d9

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Jan 3, 2025