Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NeMo is not friendly to HF compatibility. #12166

Open
dyang67 opened this issue Feb 13, 2025 · 1 comment
Open

NeMo is not friendly to HF compatibility. #12166

dyang67 opened this issue Feb 13, 2025 · 1 comment
Assignees

Comments

@dyang67
Copy link

dyang67 commented Feb 13, 2025

Why isn’t NeMo compatible with HF’s checkpoints?

Each conversion causes many compatibility issues, significantly increasing Development costs and time.

Since HF models are updated daily, AI models' architectures undergo frequent changes, ensuring compatibility is quite challenging.

Give an example.

Mistral offers various models, such as NeMo-12B, Mistral-Small 22B, and the latest Mistral 24B. However, NeMo can only perfectly convert Mistral-7B and Mixtral-7*8B. With significant changes in Mistral 24B, converting HF checkpoints requires substantial effort and meticulous validation, creating a considerable challenge for developers.

Has NeMo considered being more compatible with HF? Directly load HF models without checkpoint conversion?

@akoumpa
Copy link
Member

akoumpa commented Feb 18, 2025

Hi @dyang67 thank you for the feedback,

we are working on making NeMo more HF compatible.

This work is under the term "nemo-automodel", you can find peft examples here and here, as well sft.

This is still under-development and we hope to have a first release out soon. Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants