Is it possible to auto truncate text when generating embeddings? #13636

Unanswered

gerritvd asked this question in Q&A

gerritvd
Feb 21, 2025

When sending text that is larger than the context lenght of the model, vLLM throws an error with:

This model\'s maximum context length is 256 tokens. However, you requested 258 tokens in the input for embedding generation. Please reduce the length of the input.

Is there an option to enable some sort of auto truncation?

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment