Mengzi-T5-base-MT模型大小 #48

yuange555 · 2022-09-21T08:55:17Z

为什么Mengzi-T5-base-MT的模型大小只有Mengzi-T5-base的一半，加载模型再保存以后，又恢复和base相同的大小

yuange555 · 2022-09-21T08:57:07Z

huajingyun · 2022-09-26T05:31:05Z

Mengzi-T5-base-MT训练过程使用fp16，保存模型的权重对应也是fp16，不影响直接加载使用。
而Mengzi-T5-base训练过程使用fp32，保存模型的权重对应也是fp32。
可以在config.json中查看参数torch_dtype，可以看到对应是float16或float32。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mengzi-T5-base-MT模型大小 #48

Mengzi-T5-base-MT模型大小 #48

yuange555 commented Sep 21, 2022

yuange555 commented Sep 21, 2022

huajingyun commented Sep 26, 2022

Mengzi-T5-base-MT模型大小 #48

Mengzi-T5-base-MT模型大小 #48

Comments

yuange555 commented Sep 21, 2022

yuange555 commented Sep 21, 2022

huajingyun commented Sep 26, 2022