Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

是否支持int4量化 #23

Open
AlexMa0 opened this issue Jun 28, 2024 · 1 comment
Open

是否支持int4量化 #23

AlexMa0 opened this issue Jun 28, 2024 · 1 comment

Comments

@AlexMa0
Copy link

AlexMa0 commented Jun 28, 2024

autosmoothquant是只支持int8的量化吗?是否可以支持int4的量化?

@AniZpZ
Copy link
Owner

AniZpZ commented Jun 28, 2024

You can use SmoothQuant to implement w4a8 quantization, but this may result in a non-negligible loss of model performance. If you are interested in performing w4a8 quantization for inference, you can refer to our new project QQQ.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants