Skip to content

Commit

Permalink
x
Browse files Browse the repository at this point in the history
Signed-off-by: SumanthRH <[email protected]>
  • Loading branch information
SumanthRH committed Feb 6, 2025
1 parent b9c0260 commit 7afb49f
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion skythought/skythought_evals/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ We provide a wrapper script `eval.py` to conveniently run reasoning benchmarks.
**NOTE**: For reproducing `Sky-T1-32B-Preview` results on `AIME` and `GPQADiamond` dataset, pass in temperatures as `0.7`.

```shell
python -m skythought_evals.eval --model NovaSky-AI/Sky-T1-32B-Preview --evals=aime,gpqa_diamond --tp=8 --temperatures 0.7 --n 8
python -m skythought_evals.eval --model NovaSky-AI/Sky-T1-32B-Preview --evals=aime,gpqa_diamond --tp=8 --temperatures 0.7 --n 16
```

#### Example Usage
Expand Down
2 changes: 1 addition & 1 deletion skythought/skythought_evals/inference_and_check.py
Original file line number Diff line number Diff line change
Expand Up @@ -666,7 +666,7 @@ def main():
if args.result_dir and not os.path.exists(args.result_dir):
os.makedirs(args.result_dir)
temperature_str = ",".join(map(str, temperatures))
file_suffix = f"{model_config.name}_{args.task}_{args.split}_subset_{args.subset}_filter_{args.filter_difficulty}"
file_suffix = f"{model_config.name}_{args.task}_{args.split}_subset_{args.subset}_filter_{args.filter_difficulty}" + \
f"_s{args.start}_e{args.end}_t{temperature_str}_n{args.n}"
if (
args.math_difficulty_lower_bound is not None
Expand Down

0 comments on commit 7afb49f

Please sign in to comment.