Skip to content

v0.4.8

Latest
Compare
Choose a tag to compare
@baberabb baberabb released this 05 Mar 07:49
· 105 commits to main since this release
6d2abda

lm-eval v0.4.8 Release Notes

Key Improvements

  • New Backend Support:

  • Breaking Change: Python 3.8 support has been dropped as it reached end of life. Please upgrade to Python 3.9 or newer.

  • Added Support for gen_prefix in config, allowing you to append text after the <|assistant|> token (or at the end of non-chat prompts) - particularly effective for evaluating instruct models

New Benchmarks & Tasks

Code Evaluation

Multilingual Expansion

Ethics & Reasoning

Others

We extend our thanks to all contributors who made this release possible and to our users for your continued support and feedback.

Thanks, the LM Eval Harness team (@baberabb and @lintangsutawika)

What's Changed

New Contributors

Full Changelog: v0.4.7...v0.4.8