We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, it is a nice work of GRPO on multimodal Reasoning!
Could you provide solutions to training qwen2.5-VL or deepseek-vl2 using GRPO?
The text was updated successfully, but these errors were encountered:
I think it can be naturally supported by updating transformers
transformers
Sorry, something went wrong.
Qwen2.5-VL is now naturally supported by Transformers and vLLM.
No branches or pull requests
Hi, it is a nice work of GRPO on multimodal Reasoning!
Could you provide solutions to training qwen2.5-VL or deepseek-vl2 using GRPO?
The text was updated successfully, but these errors were encountered: