Model request: Hermes-3-Llama-3.2-3B #650

yyjhao · 2025-01-18T17:38:45Z

https://huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B

There's Hermes-3-Llama-3.1 but 3.2 is better

FriskyFennecFox · 2025-01-18T18:06:06Z

It seems that there is a distinct lack of 3B models. There's Llama-3.2-3B-Instruct which is notorious for its refusals, making it unsuitable for the cases that require the model to always return expected generations. Other than that, there's only Qwen2.5-3B-Instruct and Qwen2.5-Coder-3B-Instruct, with the latter being a more niche-focused model.

The proposed model should fill this gap well. NousResearch is a well-known player in the LLM field and the Hermes 3 series of models have been researched and developed with enough care to be considered production-ready.

For a project with a mission to deliver in-browser LLMs, I think it's a must to add more official support for the smaller models.

CharlieFRuan · 2025-01-21T08:18:06Z

Thanks for the request, added in npm 0.2.78:

[Version] Bump version to 0.2.78 #653

CharlieFRuan · 2025-01-22T01:41:10Z

It is also now live on https://chat.webllm.ai/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model request: Hermes-3-Llama-3.2-3B #650

Model request: Hermes-3-Llama-3.2-3B #650

yyjhao commented Jan 18, 2025

FriskyFennecFox commented Jan 18, 2025

CharlieFRuan commented Jan 21, 2025 •

edited

Loading

CharlieFRuan commented Jan 22, 2025

Model request: Hermes-3-Llama-3.2-3B #650

Model request: Hermes-3-Llama-3.2-3B #650

Comments

yyjhao commented Jan 18, 2025

FriskyFennecFox commented Jan 18, 2025

CharlieFRuan commented Jan 21, 2025 • edited Loading

CharlieFRuan commented Jan 22, 2025

CharlieFRuan commented Jan 21, 2025 •

edited

Loading