[Model] Add DeepSeek-R1-Distill and Hermes-3-Llama-3.2 #652

CharlieFRuan · 2025-01-21T07:22:01Z

This PR adds the following models to the prebuilt list:

DeepSeek-R1-Distill-Qwen-7B-q4f16_1-MLC
DeepSeek-R1-Distill-Qwen-7B-q4f32_1-MLC
DeepSeek-R1-Distill-Llama-8B-q4f16_1-MLC
DeepSeek-R1-Distill-Llama-8B-q4f32_1-MLC
Hermes-3-Llama-3.2-3B-q4f16_1-MLC
Hermes-3-Llama-3.2-3B-q4f32_1-MLC

We will add DeepSeek-R1-Distill-Qwen-1.5B afterward, which is currently experiencing correctness issues.

Separately, we fix the handling of role_content_sep and role_empty_sep when it is "", which evaluates to false (currently we make it ": ", which is inconsistent with what the model expects).

### Change - The only change is #652, adding the following models: - `DeepSeek-R1-Distill-Qwen-7B-q4f16_1-MLC` - `DeepSeek-R1-Distill-Qwen-7B-q4f32_1-MLC` - `DeepSeek-R1-Distill-Llama-8B-q4f16_1-MLC` - `DeepSeek-R1-Distill-Llama-8B-q4f32_1-MLC` - `Hermes-3-Llama-3.2-3B-q4f16_1-MLC` - `Hermes-3-Llama-3.2-3B-q4f32_1-MLC` ### TVMjs - No change, version `0.18.0-dev2` just like 0.2.71

[Model] Add DeepSeek-R1-Distill

48f333e

CharlieFRuan merged commit 808685b into mlc-ai:main Jan 21, 2025
1 check passed

CharlieFRuan mentioned this pull request Jan 21, 2025

[Version] Bump version to 0.2.78 #653

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add DeepSeek-R1-Distill and Hermes-3-Llama-3.2 #652

[Model] Add DeepSeek-R1-Distill and Hermes-3-Llama-3.2 #652

CharlieFRuan commented Jan 21, 2025

[Model] Add DeepSeek-R1-Distill and Hermes-3-Llama-3.2 #652

[Model] Add DeepSeek-R1-Distill and Hermes-3-Llama-3.2 #652

Conversation

CharlieFRuan commented Jan 21, 2025