Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audio Live Mode with Tool Use #62

Open
dselman opened this issue Feb 3, 2025 · 1 comment
Open

Audio Live Mode with Tool Use #62

dselman opened this issue Feb 3, 2025 · 1 comment
Labels
duplicate This issue or pull request already exists

Comments

@dselman
Copy link

dselman commented Feb 3, 2025

Description of the feature request:

The documentation is quite vague on this issue: https://ai.google.dev/api/multimodal-live#function-calling

Specifically, "Audio inputs and audio outputs negatively impact the model's ability to use function calling." — which is what I think I am seeing. When I use tools with audio input & output the tool responses sent back seem to be interpreted as part of the conversation, or at least the audio part of the conversation gets very confused.

Is this a bug? Or limitation? Should we expect improvements for this use case?

What problem are you trying to solve with this feature?

I would like to conduct an audio live mode interaction (audio In and Out) but with calls to some tools being performed in the background.

Any other information you'd like to share?

No response

@hapticdata hapticdata added the duplicate This issue or pull request already exists label Feb 8, 2025
@hapticdata
Copy link
Collaborator

#47 things are always getting better 😉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Projects
None yet
Development

No branches or pull requests

2 participants