You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You can use SmoothQuant to implement w4a8 quantization, but this may result in a non-negligible loss of model performance. If you are interested in performing w4a8 quantization for inference, you can refer to our new project QQQ.
autosmoothquant是只支持int8的量化吗?是否可以支持int4的量化?
The text was updated successfully, but these errors were encountered: