Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model request: Hermes-3-Llama-3.2-3B #650

Open
yyjhao opened this issue Jan 18, 2025 · 3 comments
Open

Model request: Hermes-3-Llama-3.2-3B #650

yyjhao opened this issue Jan 18, 2025 · 3 comments

Comments

@yyjhao
Copy link

yyjhao commented Jan 18, 2025

https://huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B

There's Hermes-3-Llama-3.1 but 3.2 is better

@FriskyFennecFox
Copy link

It seems that there is a distinct lack of 3B models. There's Llama-3.2-3B-Instruct which is notorious for its refusals, making it unsuitable for the cases that require the model to always return expected generations. Other than that, there's only Qwen2.5-3B-Instruct and Qwen2.5-Coder-3B-Instruct, with the latter being a more niche-focused model.

The proposed model should fill this gap well. NousResearch is a well-known player in the LLM field and the Hermes 3 series of models have been researched and developed with enough care to be considered production-ready.

For a project with a mission to deliver in-browser LLMs, I think it's a must to add more official support for the smaller models.

@CharlieFRuan
Copy link
Contributor

CharlieFRuan commented Jan 21, 2025

Thanks for the request, added in npm 0.2.78:

@CharlieFRuan
Copy link
Contributor

It is also now live on https://chat.webllm.ai/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants