-
Notifications
You must be signed in to change notification settings - Fork 27.7k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Qwen2FlashAttention sliding windows are applied to wrong layers
bug
#35896
opened Jan 26, 2025 by
hzhua
safetensors_rust.SafetensorError: Error while serializing: IoError(Os { code: 5, kind: Uncategorized, message: "Input/output error" })
bug
#35895
opened Jan 26, 2025 by
JohnConnor123
4 tasks
Huggingface/transfomers v3.3.2 is throwing import errors when building on Nextjs 14
bug
#35884
opened Jan 25, 2025 by
cookiejul
2 of 4 tasks
Mllama training via FSDP device and dtype misassignment
bug
#35880
opened Jan 24, 2025 by
blbadger
2 of 4 tasks
Support Shared Cache
Cache
Feature request
Request for a new feature
#35876
opened Jan 24, 2025 by
LoserCheems
ZeroShotClassificationArgumentHandler should be explicit it has a somewhat unsafe internal behaviour.
Feature request
Request for a new feature
#35874
opened Jan 24, 2025 by
nicolasdalsass
ERROR: Video features and Video Tokens do not match!!!
bug
Multimodal
VLM
#35869
opened Jan 24, 2025 by
Diezz01
Paliegemma Pad Token not Masked
bug
Multimodal
VLM
#35855
opened Jan 23, 2025 by
TangsengT
2 of 4 tasks
resume_from_checkpoint failed when using PEFT LORA
bug
#35850
opened Jan 23, 2025 by
HenryYueHY
2 of 4 tasks
forward() got an unexpected keyword argument 'num_items_in_batch'
bug
#35838
opened Jan 22, 2025 by
Bachstelze
2 of 4 tasks
tokenizer_class:
LlamaTokenizerFast
becomes LlamaTokenizer
after load + immediate save
bug
#35832
opened Jan 22, 2025 by
Qubitium
2 of 4 tasks
ImportError: cannot import name 'NoneType' from 'types' on main in Python 3.9
bug
#35827
opened Jan 22, 2025 by
harupy
1 of 4 tasks
model.gradient_checkpointing_enable() makes loss.requires_grad be False
bug
#35826
opened Jan 22, 2025 by
ZCWei51
2 of 4 tasks
multi-gpu: test_model_parallel_beam_search tests fail with "IndexError: list index out of range"
#35824
opened Jan 21, 2025 by
dvrogozh
[Feature Request] Support register customize quantization method out-of-tree
Feature request
Request for a new feature
#35814
opened Jan 21, 2025 by
ice-tong
RWKV CUDA error: an illegal memory access was encountered during training from scratch
#35805
opened Jan 21, 2025 by
npkanaka
Previous Next
ProTip!
Follow long discussions with comments:>50.