Skip to content

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12… #13

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12…

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12… #13

Annotations

1 warning

codespell (3.12)

succeeded Jan 16, 2025 in 16s