Skip to content

Commit

Permalink
Additional tuning for grouped paged attention kernel
Browse files Browse the repository at this point in the history
Changed:
- num_stages
  • Loading branch information
whchung committed Feb 1, 2025
1 parent 9c5980e commit eb34954
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -467,7 +467,7 @@ def _decode_grouped_att_m_fwd(
NUM_KV_SPLITS=NUM_KV_SPLITS,
logit_cap=logit_cap,
num_warps=4,
num_stages=2,
num_stages=1,
Lk=Lk,
Lv=Lv,
**extra_kargs,
Expand Down

0 comments on commit eb34954

Please sign in to comment.