You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't want to run inference or training, just access the LLM weights in C++. Is there a way I can do this? I could not find any documentation for this online, almost everything suggests converting to GGUF and running inference without any information about accessing the underlying tensors.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I don't want to run inference or training, just access the LLM weights in C++. Is there a way I can do this? I could not find any documentation for this online, almost everything suggests converting to GGUF and running inference without any information about accessing the underlying tensors.
Beta Was this translation helpful? Give feedback.
All reactions