Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,701 workflow runs
5,701 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

humaneval instruct
Tasks Modified #4112: Pull request #2650 opened by baberabb
January 22, 2025 16:49 1m 57s humaneval_instruct
January 22, 2025 16:49 1m 57s
Easily evaluate models steered by SAEs
Tasks Modified #4110: Pull request #2641 synchronize by AMindToThink
January 22, 2025 07:04 Action required AMindToThink:sae_steered
January 22, 2025 07:04 Action required
Easily evaluate models steered by SAEs
Unit Tests #4082: Pull request #2641 synchronize by AMindToThink
January 22, 2025 07:04 Action required AMindToThink:sae_steered
January 22, 2025 07:04 Action required
add llama3 tasks
Unit Tests #4081: Pull request #2556 synchronize by baberabb
January 22, 2025 00:16 6m 52s llama
January 22, 2025 00:16 6m 52s
add llama3 tasks
Tasks Modified #4109: Pull request #2556 synchronize by baberabb
January 22, 2025 00:16 2m 27s llama
January 22, 2025 00:16 2m 27s
add llama3 tasks
Unit Tests #4080: Pull request #2556 synchronize by baberabb
January 21, 2025 23:44 6m 45s llama
January 21, 2025 23:44 6m 45s
add llama3 tasks
Tasks Modified #4108: Pull request #2556 synchronize by baberabb
January 21, 2025 23:44 2m 5s llama
January 21, 2025 23:44 2m 5s
add llama3 tasks
Tasks Modified #4107: Pull request #2556 synchronize by baberabb
January 21, 2025 23:38 3m 1s llama
January 21, 2025 23:38 3m 1s
add llama3 tasks
Unit Tests #4079: Pull request #2556 synchronize by baberabb
January 21, 2025 23:38 6m 59s llama
January 21, 2025 23:38 6m 59s
add llama3 tasks
Unit Tests #4078: Pull request #2556 synchronize by baberabb
January 21, 2025 22:18 6m 57s llama
January 21, 2025 22:18 6m 57s
add llama3 tasks
Tasks Modified #4106: Pull request #2556 synchronize by baberabb
January 21, 2025 22:18 2m 23s llama
January 21, 2025 22:18 2m 23s
add llama3 tasks
Unit Tests #4077: Pull request #2556 synchronize by baberabb
January 21, 2025 22:08 7m 9s llama
January 21, 2025 22:08 7m 9s
add llama3 tasks
Tasks Modified #4105: Pull request #2556 synchronize by baberabb
January 21, 2025 22:08 2m 4s llama
January 21, 2025 22:08 2m 4s
add llama3 tasks
Unit Tests #4076: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 7m 44s llama
January 21, 2025 22:06 7m 44s
add llama3 tasks
Tasks Modified #4104: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 1m 49s llama
January 21, 2025 22:06 1m 49s
add llama3 tasks
Tasks Modified #4103: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 1m 50s llama
January 21, 2025 22:06 1m 50s
add llama3 tasks
Unit Tests #4075: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 7m 38s llama
January 21, 2025 22:06 7m 38s
add llama3 tasks
Unit Tests #4074: Pull request #2556 synchronize by baberabb
January 21, 2025 22:00 7m 31s llama
January 21, 2025 22:00 7m 31s
add llama3 tasks
Tasks Modified #4102: Pull request #2556 synchronize by baberabb
January 21, 2025 22:00 1m 51s llama
January 21, 2025 22:00 1m 51s
Easily evaluate models steered by SAEs
Tasks Modified #4101: Pull request #2641 synchronize by AMindToThink
January 21, 2025 20:57 Action required AMindToThink:sae_steered
January 21, 2025 20:57 Action required
Easily evaluate models steered by SAEs
Unit Tests #4073: Pull request #2641 synchronize by AMindToThink
January 21, 2025 20:57 Action required AMindToThink:sae_steered
January 21, 2025 20:57 Action required
add llama3 tasks
Unit Tests #4072: Pull request #2556 synchronize by baberabb
January 21, 2025 17:27 7m 18s llama
January 21, 2025 17:27 7m 18s
add llama3 tasks
Tasks Modified #4100: Pull request #2556 synchronize by baberabb
January 21, 2025 17:27 2m 26s llama
January 21, 2025 17:27 2m 26s