Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,923 workflow runs
1,923 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add G-Pass@k Metric
Tests #2253: Pull request #589 synchronize by jnanliu
February 28, 2025 06:51 Action required jnanliu:g-pass-at-k-dev
February 28, 2025 06:51 Action required
Add Doc Strings to Config Files
Tests #2252: Pull request #465 synchronize by ParagEkbote
February 27, 2025 19:06 Action required ParagEkbote:Document-Custom-Model-Files
February 27, 2025 19:06 Action required
Add Doc Strings to Config Files
Tests #2251: Pull request #465 synchronize by ParagEkbote
February 27, 2025 18:56 Action required ParagEkbote:Document-Custom-Model-Files
February 27, 2025 18:56 Action required
Add Doc Strings to Config Files
Tests #2250: Pull request #465 synchronize by ParagEkbote
February 27, 2025 17:19 Action required ParagEkbote:Document-Custom-Model-Files
February 27, 2025 17:19 Action required
Add Doc Strings to Config Files
Tests #2249: Pull request #465 synchronize by ParagEkbote
February 26, 2025 17:59 Action required ParagEkbote:Document-Custom-Model-Files
February 26, 2025 17:59 Action required
Add G-Pass@k Metric
Tests #2248: Pull request #589 opened by jnanliu
February 26, 2025 10:14 Action required jnanliu:g-pass-at-k-dev
February 26, 2025 10:14 Action required
Add subsets for lcb (#587)
Tests #2247: Commit ed08481 pushed by NathanHB
February 26, 2025 09:52 37m 51s main
February 26, 2025 09:52 37m 51s
Fixing some silent bugs in Arabic Custom Tasks
Tests #2246: Pull request #556 synchronize by alielfilali01
February 26, 2025 08:00 Action required alielfilali01:main
February 26, 2025 08:00 Action required
Fixing some silent bugs in Arabic Custom Tasks
Tests #2245: Pull request #556 synchronize by alielfilali01
February 26, 2025 07:43 Action required alielfilali01:main
February 26, 2025 07:43 Action required
Propagate vLLM batch size controls
Tests #2244: Pull request #588 opened by alvin319
February 25, 2025 19:54 Action required alvin319:vllm-batch-size-control
February 25, 2025 19:54 Action required
adds aime24, 25 and math500 (#586)
Tests #2243: Commit 4c9af85 pushed by NathanHB
February 25, 2025 17:06 37m 45s main
February 25, 2025 17:06 37m 45s
Add subsets for lcb
Tests #2242: Pull request #587 synchronize by plaguss
February 25, 2025 15:40 37m 34s plaguss:lcb-v4
February 25, 2025 15:40 37m 34s
Add subsets for lcb
Tests #2241: Pull request #587 synchronize by plaguss
February 25, 2025 15:37 3m 22s plaguss:lcb-v4
February 25, 2025 15:37 3m 22s
Add subsets for lcb
Tests #2240: Pull request #587 synchronize by plaguss
February 25, 2025 15:35 37m 58s plaguss:lcb-v4
February 25, 2025 15:35 37m 58s
Add subsets for lcb
Tests #2239: Pull request #587 opened by plaguss
February 25, 2025 15:21 38m 21s plaguss:lcb-v4
February 25, 2025 15:21 38m 21s
Add draft functionality for a generic sandboxed code running
Tests #2238: Pull request #580 synchronize by plaguss
February 25, 2025 15:01 38m 40s plaguss:code-run
February 25, 2025 15:01 38m 40s
adds aime24, 25 and math500
Tests #2237: Pull request #586 synchronize by NathanHB
February 25, 2025 14:59 38m 27s nathan-add-aime24-25
February 25, 2025 14:59 38m 27s
adds aime24, 25 and math500
Tests #2236: Pull request #586 synchronize by NathanHB
February 25, 2025 14:40 38m 5s nathan-add-aime24-25
February 25, 2025 14:40 38m 5s
adds aime24, 25 and math500
Tests #2235: Pull request #586 synchronize by NathanHB
February 25, 2025 13:15 39m 21s nathan-add-aime24-25
February 25, 2025 13:15 39m 21s
adds aime24, 25 and math500
Tests #2234: Pull request #586 synchronize by NathanHB
February 25, 2025 12:24 44m 32s nathan-add-aime24-25
February 25, 2025 12:24 44m 32s
adds aime24, 25 and math500
Tests #2233: Pull request #586 opened by NathanHB
February 25, 2025 11:09 38m 0s nathan-add-aime24-25
February 25, 2025 11:09 38m 0s
docs: update README to reflect new model evaluation entry points (#581)
Tests #2232: Commit 066f84f pushed by NathanHB
February 25, 2025 09:50 38m 27s main
February 25, 2025 09:50 38m 27s
parse seed for vllm (#585)
Tests #2231: Commit 95068aa pushed by NathanHB
February 25, 2025 09:50 37m 48s main
February 25, 2025 09:50 37m 48s
Push details without converting fields to str (#572)
Tests #2230: Commit 7b42113 pushed by NathanHB
February 25, 2025 09:06 37m 56s main
February 25, 2025 09:06 37m 56s
Added custom model inference.
Tests #2229: Pull request #437 synchronize by NathanHB
February 24, 2025 14:05 Action required JoelNiklaus:add-custom-model
February 24, 2025 14:05 Action required