Move all Llama Stack types to llama-stack #279

ashwinb · 2025-02-14T05:13:48Z

llama-models should have extremely minimal cruft. Its sole purpose should be didactic -- show the simplest implementation of the llama models and document the prompt formats, etc.

As such:

it should not have any "json_schema" registration, etc. since that is only necessary for API generation which is a Llama Stack concern
there is a single datatypes.py file at the top-level containing a minimal list of datatypes. Nothing inside llama3/api/ anymore.

Similarly, all the registrations of various Llama SKUs, their parameters, etc. can arguably move to llama stack (although that is debatable) since only the Llama Stack CLI is concerned about those.

NOTE: *This PR cannot be landed until the corresponding llama-stack PR (which removes dependencies on llama-models) lands first. meta-llama/llama-stack#1098 *

Test Plan

(Partial, since most of the tests will be run on the Llama Stack side.)

Run chat completion:

torchrun -m models.scripts.example_chat_completion \
  ~/.llama/checkpoints/Llama3.2-3B-Instruct
torchrun -m models.scripts.multimodal_example_chat_completion \
  ~/.llama/checkpoints/Llama3.2-11B-Vision-Instruct

pytest -s -v models/

github-actions · 2025-02-14T05:18:25Z

…1098) llama-models should have extremely minimal cruft. Its sole purpose should be didactic -- show the simplest implementation of the llama models and document the prompt formats, etc. This PR is the complement to meta-llama/llama-models#279 ## Test Plan Ensure all `llama` CLI `model` sub-commands work: ```bash llama model list llama model download --model-id ... llama model prompt-format -m ... ``` Ran tests: ```bash cd tests/client-sdk LLAMA_STACK_CONFIG=fireworks pytest -s -v inference/ LLAMA_STACK_CONFIG=fireworks pytest -s -v vector_io/ LLAMA_STACK_CONFIG=fireworks pytest -s -v agents/ ``` Create a fresh venv `uv venv && source .venv/bin/activate` and run `llama stack build --template fireworks --image-type venv` followed by `llama stack run together --image-type venv` <-- the server runs Also checked that the OpenAPI generator can run and there is no change in the generated files as a result. ```bash cd docs/openapi_generator sh run_openapi_generator.sh ```

ashwinb requested review from yanxi0830, hardikjshah, dltn, raghotham and ehhuang as code owners February 14, 2025 05:13

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 14, 2025

Move all Llama Stack types to llama-stack

1127165

ashwinb force-pushed the move_to_stack branch from 43afc48 to 1127165 Compare February 14, 2025 05:19

Add back llama3.api.datatypes for back compat

0288251

ashwinb mentioned this pull request Feb 14, 2025

chore: move all Llama Stack types from llama-models to llama-stack meta-llama/llama-stack#1098

Merged

remove the duplicate file, not needed for back compat

67a38b8

ehhuang approved these changes Feb 14, 2025

View reviewed changes

ashwinb merged commit c4d8644 into main Feb 14, 2025
1 check passed

ashwinb deleted the move_to_stack branch February 14, 2025 17:07

ashwinb mentioned this pull request Feb 14, 2025

chore(precommit): add uv-sync hook #278

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move all Llama Stack types to llama-stack #279

Move all Llama Stack types to llama-stack #279

ashwinb commented Feb 14, 2025 •

edited

Loading

github-actions bot commented Feb 14, 2025

Move all Llama Stack types to llama-stack #279

Move all Llama Stack types to llama-stack #279

Conversation

ashwinb commented Feb 14, 2025 • edited Loading

Test Plan

github-actions bot commented Feb 14, 2025

ashwinb commented Feb 14, 2025 •

edited

Loading