[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't build on ROCm #13235

SageMoore · 2025-02-13T18:33:41Z

This should hopefully get the AMD CI green

Signed-off-by: Sage Moore <[email protected]>

github-actions · 2025-02-13T18:33:54Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

shajrawi

Thanks for the quick fix!

mgoin

LGTM, are there no changes needed in the cmake file?

SageMoore · 2025-02-13T21:28:54Z

LGTM, are there no changes needed in the cmake file?

Doesn't look like it. #12784 seems to have done a good job making sure the kernel only gets built on the correct cuda version.

DarkLight1337 · 2025-02-14T04:18:27Z

Please fix the merge conflicts

…rocm-fp4-fix Signed-off-by: Sage Moore <[email protected]>

…ild on ROCm (vllm-project#13235)

init

a1cac3d

Signed-off-by: Sage Moore <[email protected]>

shajrawi approved these changes Feb 13, 2025

View reviewed changes

mgoin approved these changes Feb 13, 2025

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 13, 2025

mgoin changed the title ~~Update torch_bindings so that scaled_fp4_quant isn't build on ROCm~~ [Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't build on ROCm Feb 13, 2025

mgoin added the AMD GPU label Feb 13, 2025

mgoin added the force-merge label Feb 13, 2025

Merge branch 'main' of https://github.com/neuralmagic/vllm into sage/…

1378dd2

…rocm-fp4-fix Signed-off-by: Sage Moore <[email protected]>

DarkLight1337 enabled auto-merge (squash) February 14, 2025 15:28

Merge branch 'main' into sage/rocm-fp4-fix

81c14e2

simon-mo merged commit c9f9d5b into vllm-project:main Feb 15, 2025
50 of 58 checks passed

Sakalya pushed a commit to Sakalya/vllm that referenced this pull request Feb 15, 2025

[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't bu…

58d59e7

…ild on ROCm (vllm-project#13235)

panf2333 pushed a commit to yottalabsai/vllm that referenced this pull request Feb 18, 2025

[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't bu…

8749fbe

…ild on ROCm (vllm-project#13235)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't build on ROCm #13235

[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't build on ROCm #13235

SageMoore commented Feb 13, 2025 •

edited

Loading

github-actions bot commented Feb 13, 2025

shajrawi left a comment

mgoin left a comment

SageMoore commented Feb 13, 2025

DarkLight1337 commented Feb 14, 2025

[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't build on ROCm #13235

[Bugfix][AMD] Update torch_bindings so that scaled_fp4_quant isn't build on ROCm #13235

Conversation

SageMoore commented Feb 13, 2025 • edited Loading

github-actions bot commented Feb 13, 2025

shajrawi left a comment

Choose a reason for hiding this comment

mgoin left a comment

Choose a reason for hiding this comment

SageMoore commented Feb 13, 2025

DarkLight1337 commented Feb 14, 2025

SageMoore commented Feb 13, 2025 •

edited

Loading