You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It seems that there is a distinct lack of 3B models. There's Llama-3.2-3B-Instruct which is notorious for its refusals, making it unsuitable for the cases that require the model to always return expected generations. Other than that, there's only Qwen2.5-3B-Instruct and Qwen2.5-Coder-3B-Instruct, with the latter being a more niche-focused model.
The proposed model should fill this gap well. NousResearch is a well-known player in the LLM field and the Hermes 3 series of models have been researched and developed with enough care to be considered production-ready.
For a project with a mission to deliver in-browser LLMs, I think it's a must to add more official support for the smaller models.
https://huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B
There's Hermes-3-Llama-3.1 but 3.2 is better
The text was updated successfully, but these errors were encountered: