Skip to content

Actions: huggingface/lighteval

Scan Secret Leaks

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
236 workflow runs
236 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Merge branch 'main' into add-gpqa-generative
Scan Secret Leaks #161: Commit 69814ba pushed by lewtun
February 5, 2025 12:24 16s add-gpqa-generative
February 5, 2025 12:24 16s
make tokenizer lazy too
Scan Secret Leaks #160: Commit 05795e0 pushed by hynky1999
February 5, 2025 12:09 18s make_bleurt_lazy
February 5, 2025 12:09 18s
make bleur lazy
Scan Secret Leaks #159: Commit 46d1585 pushed by hynky1999
February 5, 2025 12:06 17s make_bleurt_lazy
February 5, 2025 12:06 17s
Sync Math-verify (#535)
Scan Secret Leaks #158: Commit cb35bea pushed by hynky1999
February 5, 2025 11:34 16s main
February 5, 2025 11:34 16s
Tune prompt
Scan Secret Leaks #157: Commit 8b47e01 pushed by lewtun
February 5, 2025 10:14 17s add-gpqa-generative
February 5, 2025 10:14 17s
commit
Scan Secret Leaks #156: Commit b4c2d77 pushed by NathanHB
February 5, 2025 09:15 17s nathan-fix-vllm-from-file
February 5, 2025 09:15 17s
docstring
Scan Secret Leaks #155: Commit 86f4978 pushed by hynky1999
February 5, 2025 00:47 16s sync_math_verify
February 5, 2025 00:47 16s
fmt
Scan Secret Leaks #154: Commit 8b7711f pushed by hynky1999
February 5, 2025 00:45 17s sync_math_verify
February 5, 2025 00:45 17s
rm todo
Scan Secret Leaks #153: Commit a75113d pushed by hynky1999
February 5, 2025 00:25 15s sync_math_verify
February 5, 2025 00:25 15s
revert symbols, improve sets handling
Scan Secret Leaks #152: Commit c536de0 pushed by hynky1999
February 5, 2025 00:22 16s sync_math_verify
February 5, 2025 00:22 16s
update extraction match to reflect newest math-verify
Scan Secret Leaks #151: Commit c2cb488 pushed by hynky1999
February 4, 2025 19:07 17s sync_math_verify
February 4, 2025 19:07 17s
Refactor
Scan Secret Leaks #150: Commit fa00c5f pushed by lewtun
February 4, 2025 16:14 19s add-gpqa-generative
February 4, 2025 16:14 19s
Update src/lighteval/main_vllm.py
Scan Secret Leaks #149: Commit 2802744 pushed by NathanHB
February 4, 2025 15:10 17s nathan-fix-vllm-from-file
February 4, 2025 15:10 17s
Add ref
Scan Secret Leaks #148: Commit 88f939e pushed by lewtun
February 4, 2025 14:57 16s add-gpqa-generative
February 4, 2025 14:57 16s
Add GPQA for instruct models
Scan Secret Leaks #147: Commit 09c6c7b pushed by lewtun
February 4, 2025 14:54 21s add-gpqa-generative
February 4, 2025 14:54 21s
commit
Scan Secret Leaks #146: Commit 8e21cd5 pushed by NathanHB
February 4, 2025 14:07 16s nathan-fix-vllm-from-file
February 4, 2025 14:07 16s
commit
Scan Secret Leaks #145: Commit a19e07c pushed by NathanHB
February 4, 2025 14:04 19s nathan-fix-vllm-from-file
February 4, 2025 14:04 19s
Add custom task (bac-fr) for evaluation of models in french (#518)
Scan Secret Leaks #144: Commit d7a1f11 pushed by clefourrier
February 3, 2025 16:08 16s main
February 3, 2025 16:08 16s
Update french_evals.py
Scan Secret Leaks #143: Commit be7da17 pushed by clefourrier
February 3, 2025 12:13 17s main
February 3, 2025 12:13 17s
adds olympiad bench (#521)
Scan Secret Leaks #142: Commit d332207 pushed by NathanHB
January 31, 2025 14:20 17s main
January 31, 2025 14:20 17s
fix review
Scan Secret Leaks #141: Commit 1dd74f0 pushed by clefourrier
January 30, 2025 18:53 16s clem_last_exam
January 30, 2025 18:53 16s
adding comments
Scan Secret Leaks #140: Commit 7f34927 pushed by NathanHB
January 30, 2025 13:22 16s nathan-adds-olympiad-bench
January 30, 2025 13:22 16s
Improve readability of the quick tour. (#501)
Scan Secret Leaks #139: Commit 515bd01 pushed by clefourrier
January 30, 2025 13:11 16s main
January 30, 2025 13:11 16s
Implemented the possibility to load predictions from details files an…
Scan Secret Leaks #138: Commit 94fc5a2 pushed by NathanHB
January 29, 2025 14:59 18s main
January 29, 2025 14:59 18s
revert harcoding
Scan Secret Leaks #137: Commit c3e02ea pushed by clefourrier
January 29, 2025 14:20 16s clem_last_exam
January 29, 2025 14:20 16s