You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I try to reproduce the results with gradient checkpointing enabled, I consistently encounter the following error. However, everything runs fine when gradient checkpointing is disabled.
Have you encountered a similar issue before?
Error message:
[rank0]: position_ids, rope_deltas = self.get_rope_index(
[rank0]: ^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/miniconda3/envs/openr1_multimodal/lib/python3.12/site-packages/transformers/models/qwen2_vl/modeling_qwen2_vl.py", line 1456, in get_rope_index
[rank0]: input_ids = input_ids[attention_mask[i] == 1]
[rank0]: ~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: IndexError: The shape of the mask [588] at index 0 does not match the shape of the indexed tensor [587] at index 0
Hi, thank you for your outstanding work!
When I try to reproduce the results with gradient checkpointing enabled, I consistently encounter the following error. However, everything runs fine when gradient checkpointing is disabled.
Have you encountered a similar issue before?
The text was updated successfully, but these errors were encountered: