You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`here is the message, looking for using it with comfyui
Building wheels for collected packages: nunchaku
Building editable for nunchaku (pyproject.toml) ... error
error: subprocess-exited-with-error
× Building editable for nunchaku (pyproject.toml) did not run successfully.
│ exit code: 1
╰─> [325 lines of output]
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/subclasses/functional_tensor.py:295: UserWarning: Failed to initialize NumPy: No module named 'numpy' (Triggered internally at ../torch/csrc/utils/tensor_numpy.cpp:84.)
cpu = conversion_method_template(device=torch.device("cpu"))
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/dist.py:330: InformationOnly: Normalizing '0.0.2beta0' to '0.0.2b0'
self.metadata.version = self.normalize_version(self.metadata.version)
running editable_wheel
creating /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info
writing /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/PKG-INFO
writing dependency_links to /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/dependency_links.txt
writing requirements to /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/requires.txt
writing top-level names to /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/top_level.txt
writing manifest file '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/SOURCES.txt'
reading manifest file '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/SOURCES.txt'
reading manifest template 'MANIFEST.in'
warning: no files found matching '.hpp' under directory 'src'
warning: no files found matching '.ipp' under directory 'src'
warning: no files found matching '.hpp' under directory 'nunchaku/csrc'
warning: no files found matching '.ipp' under directory 'nunchaku/csrc'
warning: no files found matching '.cu' under directory 'nunchaku/csrc'
warning: no files found matching '.cuh' under directory 'nunchaku/csrc'
warning: no files found matching '.hpp' under directory 'third_party/Block-Sparse-Attention/csrc/block_sparse_attn'
warning: no files found matching '.ipp' under directory 'third_party/Block-Sparse-Attention/csrc/block_sparse_attn'
warning: no files found matching '.cpp' under directory 'third_party/cutlass/include'
warning: no files found matching '.ipp' under directory 'third_party/cutlass/include'
warning: no files found matching '.cu' under directory 'third_party/cutlass/include'
warning: no files found matching '.cuh' under directory 'third_party/cutlass/include'
warning: no files found matching '.cpp' under directory 'third_party/json/include'
warning: no files found matching '.h' under directory 'third_party/json/include'
warning: no files found matching '.ipp' under directory 'third_party/json/include'
warning: no files found matching '.cpp' under directory 'third_party/mio/include'
warning: no files found matching '.h' under directory 'third_party/mio/include'
warning: no files found matching '.cpp' under directory 'third_party/spdlog/include'
warning: no files found matching '.hpp' under directory 'third_party/spdlog/include'
warning: no files found matching '.ipp' under directory 'third_party/spdlog/include'
adding license file 'LICENCE.txt'
writing manifest file '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/SOURCES.txt'
creating '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku-0.0.2b0.dist-info'
creating /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku-0.0.2b0.dist-info/WHEEL
running build_py
running build_ext
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py:416: UserWarning: The detected CUDA version (12.0) has a minor version mismatch with the version that was used to compile PyTorch (12.4). Most likely this shouldn't be a problem.
warnings.warn(CUDA_MISMATCH_WARN.format(cuda_str_version, torch.version.cuda))
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py:426: UserWarning: There are no g++ version bounds defined for CUDA version 12.0
warnings.warn(f'There are no {compiler_name} version bounds defined for CUDA version {cuda_str_version}')
building 'nunchaku.C' extension
creating /tmp/tmprtqst5pb.build-temp/nunchaku/csrc
creating /tmp/tmprtqst5pb.build-temp/src
creating /tmp/tmprtqst5pb.build-temp/src/interop
creating /tmp/tmprtqst5pb.build-temp/src/kernels
creating /tmp/tmprtqst5pb.build-temp/src/kernels/awq
creating /tmp/tmprtqst5pb.build-temp/third_party/Block-Sparse-Attention/csrc/block_sparse_attn
creating /tmp/tmprtqst5pb.build-temp/third_party/Block-Sparse-Attention/csrc/block_sparse_attn/src
Emitting ninja build file /tmp/tmprtqst5pb.build-temp/build.ninja...
Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
[1/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/activation_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="gcc"' '-DPYBIND11_STDLIB="libstdcpp"' '-DPYBIND11_BUILD_ABI="cxxabi1011"' -DTORCH_EXTENSION_NAME=C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/activation_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/activation_kernels.cu".
[2/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/misc_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/misc_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/misc_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/misc_kernels.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/misc_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/misc_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/misc_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/misc_kernels.cu".
[3/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/layernorm_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/layernorm_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/layernorm_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/layernorm_kernels.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/layernorm_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/layernorm_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/layernorm_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/layernorm_kernels.cu".
[4/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/awq/gemv_awq.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/awq/gemv_awq.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/awq/gemv_awq.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/awq/gemv_awq.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/awq/gemv_awq.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/awq/gemv_awq.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/awq/gemv_awq.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/awq/gemv_awq.cu".
[5/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_f16.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/gemm_f16.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_f16.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_f16.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_f16.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/gemm_f16.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_f16.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/gemm_f16.cu".
[6/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_batched.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/gemm_batched.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_batched.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_batched.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_batched.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/gemm_batched.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_batched.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/gemm_batched.cu".
[7/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_w4a4.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/gemm_w4a4.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_w4a4.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_w4a4.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_w4a4.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/gemm_w4a4.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/gemm_w4a4.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
/home/theduc/nunchaku/src/common.h(41): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/common.h(48): error: call to consteval function "std::source_location::current" did not produce a valid constant expression
/usr/include/c++/12/source_location(59): note #2703-D: cannot call non-constexpr function "__builtin_source_location" (declared implicitly)
/home/theduc/nunchaku/src/kernels/gemm_w4a4.cu(3063): warning #549-D: variable "epilogueArgs" is used before its value is set
Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"
3 errors detected in the compilation of "/home/theduc/nunchaku/src/kernels/gemm_w4a4.cu".
[8/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/src/layernorm.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/layernorm.cpp -o /tmp/tmprtqst5pb.build-temp/src/layernorm.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[9/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/src/activation.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/activation.cpp -o /tmp/tmprtqst5pb.build-temp/src/activation.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[10/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/src/Linear.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/Linear.cpp -o /tmp/tmprtqst5pb.build-temp/src/Linear.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[11/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/src/FluxModel.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/FluxModel.cpp -o /tmp/tmprtqst5pb.build-temp/src/FluxModel.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[12/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/src/Serialization.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/Serialization.cpp -o /tmp/tmprtqst5pb.build-temp/src/Serialization.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[13/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/src/interop/torch.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/interop/torch.cpp -o /tmp/tmprtqst5pb.build-temp/src/interop/torch.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[14/24] c++ -MMD -MF /tmp/tmprtqst5pb.build-temp/nunchaku/csrc/pybind.o.d -fno-strict-overflow -Wsign-compare -DNDEBUG -g -O3 -Wall -fPIC -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/nunchaku/csrc/pybind.cpp -o /tmp/tmprtqst5pb.build-temp/nunchaku/csrc/pybind.o -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -fvisibility=hidden -g -std=c++20 -UNDEBUG -Og -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
In file included from /usr/include/c++/13/cassert:44,
from /home/theduc/nunchaku/third_party/spdlog/include/spdlog/details/circular_q.h:7,
from /home/theduc/nunchaku/third_party/spdlog/include/spdlog/details/backtracer.h:6,
from /home/theduc/nunchaku/third_party/spdlog/include/spdlog/logger.h:18,
from /home/theduc/nunchaku/third_party/spdlog/include/spdlog/details/registry-inl.h:12,
from /home/theduc/nunchaku/third_party/spdlog/include/spdlog/details/registry.h:128,
from /home/theduc/nunchaku/third_party/spdlog/include/spdlog/spdlog.h:13,
from /home/theduc/nunchaku/src/common.h:23,
from /home/theduc/nunchaku/src/interop/torch.h:5,
from /home/theduc/nunchaku/nunchaku/csrc/gemm.h:3,
from /home/theduc/nunchaku/nunchaku/csrc/pybind.cpp:1:
/home/theduc/nunchaku/nunchaku/csrc/gemm.h: In member function ‘std::string QuantizedGEMM::dumpTensorINT4(Tensor)’:
/home/theduc/nunchaku/nunchaku/csrc/gemm.h:89:43: warning: comparison of integer expressions of different signedness: ‘int’ and ‘size_t’ {aka ‘long unsigned int’} [-Wsign-compare]
89 | assert(offset + i < x.numel() / 4);
| ~~~~~~~~~~~^~~~~~~~~~~~~~~
In file included from /home/theduc/nunchaku/nunchaku/csrc/pybind.cpp:2:
/home/theduc/nunchaku/nunchaku/csrc/flux.h: In lambda function:
/home/theduc/nunchaku/nunchaku/csrc/flux.h:181:48: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<float, std::allocator<float> >::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
181 | for (int i = skipRanks / 16; i < m->lora_scales.size(); i++) {
| ~~^~~~~~~~~~~~~~~~~~~~~~~
ninja: build stopped: subcommand failed.
Traceback (most recent call last):
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 2104, in _run_ninja_build
subprocess.run(
File "/home/theduc/.pyenv/versions/3.12.8/lib/python3.12/subprocess.py", line 571, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 139, in run
self._create_wheel_file(bdist_wheel)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 340, in _create_wheel_file
files, mapping = self._run_build_commands(dist_name, unpacked, lib, tmp)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 263, in _run_build_commands
self._run_build_subcommands()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 290, in _run_build_subcommands
self.run_command(name)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 316, in run_command
self.distribution.run_command(command)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 995, in run_command
super().run_command(command)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 973, in run_command
cmd_obj.run()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 99, in run
_build_ext.run(self)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 359, in run
self.build_extensions()
File "<string>", line 18, in build_extensions
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 868, in build_extensions
build_ext.build_extensions(self)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 476, in build_extensions
self._build_extensions_serial()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 502, in _build_extensions_serial
self.build_extension(ext)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 264, in build_extension
_build_ext.build_extension(self, ext)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 557, in build_extension
objects = self.compiler.compile(
^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 681, in unix_wrap_ninja_compile
_write_ninja_file_and_compile_objects(
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 1784, in _write_ninja_file_and_compile_objects
_run_ninja_build(
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 2120, in _run_ninja_build
raise RuntimeError(message) from e
RuntimeError: Error compiling objects for extension
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py:973: _DebuggingTips: Problem in editable installation.
!!
********************************************************************************
An error happened while installing `nunchaku` in editable mode.
The following steps are recommended to help debug this problem:
- Try to install the project normally, without using the editable mode.
Does the error still persist?
(If it does, try fixing the problem before attempting the editable mode).
- If you are using binary extensions, make sure you have all OS-level
dependencies installed (e.g. compilers, toolchains, binary libraries, ...).
- Try the latest version of setuptools (maybe the error was already fixed).
- If you (or your project dependencies) are using any setuptools extension
or customization, make sure they support the editable mode.
After following the steps above, if the problem still persists and
you think this is related to how setuptools handles editable installations,
please submit a reproducible example
(see https://stackoverflow.com/help/minimal-reproducible-example) to:
https://github.com/pypa/setuptools/issues
See https://setuptools.pypa.io/en/latest/userguide/development_mode.html for details.
********************************************************************************
!!
cmd_obj.run()
Traceback (most recent call last):
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 2104, in _run_ninja_build
subprocess.run(
File "/home/theduc/.pyenv/versions/3.12.8/lib/python3.12/subprocess.py", line 571, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/home/theduc/.pyenv/versions/3.12.8/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 353, in <module>
main()
File "/home/theduc/.pyenv/versions/3.12.8/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 335, in main
json_out['return_val'] = hook(**hook_input['kwargs'])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/theduc/.pyenv/versions/3.12.8/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 273, in build_editable
return hook(wheel_directory, config_settings, metadata_directory)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 476, in build_editable
return self._build_with_temp_dir(
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 407, in _build_with_temp_dir
self.run_setup()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 320, in run_setup
exec(code, locals())
File "<string>", line 115, in <module>
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/__init__.py", line 117, in setup
return distutils.core.setup(**attrs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 183, in setup
return run_commands(dist)
^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 199, in run_commands
dist.run_commands()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 954, in run_commands
self.run_command(cmd)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 995, in run_command
super().run_command(command)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 973, in run_command
cmd_obj.run()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 139, in run
self._create_wheel_file(bdist_wheel)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 340, in _create_wheel_file
files, mapping = self._run_build_commands(dist_name, unpacked, lib, tmp)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 263, in _run_build_commands
self._run_build_subcommands()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/editable_wheel.py", line 290, in _run_build_subcommands
self.run_command(name)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 316, in run_command
self.distribution.run_command(command)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 995, in run_command
super().run_command(command)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 973, in run_command
cmd_obj.run()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 99, in run
_build_ext.run(self)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 359, in run
self.build_extensions()
File "<string>", line 18, in build_extensions
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 868, in build_extensions
build_ext.build_extensions(self)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 476, in build_extensions
self._build_extensions_serial()
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 502, in _build_extensions_serial
self.build_extension(ext)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 264, in build_extension
_build_ext.build_extension(self, ext)
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 557, in build_extension
objects = self.compiler.compile(
^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 681, in unix_wrap_ninja_compile
_write_ninja_file_and_compile_objects(
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 1784, in _write_ninja_file_and_compile_objects
_run_ninja_build(
File "/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py", line 2120, in _run_ninja_build
raise RuntimeError(message) from e
RuntimeError: Error compiling objects for extension
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building editable for nunchaku
Failed to build nunchaku
ERROR: ERROR: Failed to build installable wheels for some pyproject.toml based projects (nunchaku)`
The text was updated successfully, but these errors were encountered:
`here is the message, looking for using it with comfyui
Building wheels for collected packages: nunchaku
Building editable for nunchaku (pyproject.toml) ... error
error: subprocess-exited-with-error
× Building editable for nunchaku (pyproject.toml) did not run successfully.
│ exit code: 1
╰─> [325 lines of output]
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/subclasses/functional_tensor.py:295: UserWarning: Failed to initialize NumPy: No module named 'numpy' (Triggered internally at ../torch/csrc/utils/tensor_numpy.cpp:84.)
cpu = conversion_method_template(device=torch.device("cpu"))
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/setuptools/dist.py:330: InformationOnly: Normalizing '0.0.2beta0' to '0.0.2b0'
self.metadata.version = self.normalize_version(self.metadata.version)
running editable_wheel
creating /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info
writing /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/PKG-INFO
writing dependency_links to /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/dependency_links.txt
writing requirements to /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/requires.txt
writing top-level names to /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/top_level.txt
writing manifest file '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/SOURCES.txt'
reading manifest file '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/SOURCES.txt'
reading manifest template 'MANIFEST.in'
warning: no files found matching '.hpp' under directory 'src'
warning: no files found matching '.ipp' under directory 'src'
warning: no files found matching '.hpp' under directory 'nunchaku/csrc'
warning: no files found matching '.ipp' under directory 'nunchaku/csrc'
warning: no files found matching '.cu' under directory 'nunchaku/csrc'
warning: no files found matching '.cuh' under directory 'nunchaku/csrc'
warning: no files found matching '.hpp' under directory 'third_party/Block-Sparse-Attention/csrc/block_sparse_attn'
warning: no files found matching '.ipp' under directory 'third_party/Block-Sparse-Attention/csrc/block_sparse_attn'
warning: no files found matching '.cpp' under directory 'third_party/cutlass/include'
warning: no files found matching '.ipp' under directory 'third_party/cutlass/include'
warning: no files found matching '.cu' under directory 'third_party/cutlass/include'
warning: no files found matching '.cuh' under directory 'third_party/cutlass/include'
warning: no files found matching '.cpp' under directory 'third_party/json/include'
warning: no files found matching '.h' under directory 'third_party/json/include'
warning: no files found matching '.ipp' under directory 'third_party/json/include'
warning: no files found matching '.cpp' under directory 'third_party/mio/include'
warning: no files found matching '.h' under directory 'third_party/mio/include'
warning: no files found matching '.cpp' under directory 'third_party/spdlog/include'
warning: no files found matching '.hpp' under directory 'third_party/spdlog/include'
warning: no files found matching '.ipp' under directory 'third_party/spdlog/include'
adding license file 'LICENCE.txt'
writing manifest file '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku.egg-info/SOURCES.txt'
creating '/tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku-0.0.2b0.dist-info'
creating /tmp/pip-wheel-hu865_m8/.tmp-b16zse7t/nunchaku-0.0.2b0.dist-info/WHEEL
running build_py
running build_ext
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py:416: UserWarning: The detected CUDA version (12.0) has a minor version mismatch with the version that was used to compile PyTorch (12.4). Most likely this shouldn't be a problem.
warnings.warn(CUDA_MISMATCH_WARN.format(cuda_str_version, torch.version.cuda))
/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/utils/cpp_extension.py:426: UserWarning: There are no g++ version bounds defined for CUDA version 12.0
warnings.warn(f'There are no {compiler_name} version bounds defined for CUDA version {cuda_str_version}')
building 'nunchaku.C' extension
creating /tmp/tmprtqst5pb.build-temp/nunchaku/csrc
creating /tmp/tmprtqst5pb.build-temp/src
creating /tmp/tmprtqst5pb.build-temp/src/interop
creating /tmp/tmprtqst5pb.build-temp/src/kernels
creating /tmp/tmprtqst5pb.build-temp/src/kernels/awq
creating /tmp/tmprtqst5pb.build-temp/third_party/Block-Sparse-Attention/csrc/block_sparse_attn
creating /tmp/tmprtqst5pb.build-temp/third_party/Block-Sparse-Attention/csrc/block_sparse_attn/src
Emitting ninja build file /tmp/tmprtqst5pb.build-temp/build.ninja...
Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
[1/24] /usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/activation_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="gcc"' '-DPYBIND11_STDLIB="libstdcpp"' '-DPYBIND11_BUILD_ABI="cxxabi1011"' -DTORCH_EXTENSION_NAME=C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o
/usr/bin/nvcc --generate-dependencies-with-compile --dependency-output /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o.d -I/home/theduc/nunchaku/src -I/home/theduc/nunchaku/third_party/cutlass/include -I/home/theduc/nunchaku/third_party/json/include -I/home/theduc/nunchaku/third_party/mio/include -I/home/theduc/nunchaku/third_party/spdlog/include -I/home/theduc/nunchaku/third_party/Block-Sparse-Attention/csrc/block_sparse_attn -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/TH -I/tmp/pip-build-env-o540kg68/overlay/lib/python3.12/site-packages/torch/include/THC -I/home/theduc/.pyenv/versions/3.12.8/include/python3.12 -c -c /home/theduc/nunchaku/src/kernels/activation_kernels.cu -o /tmp/tmprtqst5pb.build-temp/src/kernels/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -DENABLE_BF16=1 -DBUILD_NUNCHAKU=1 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -g -std=c++20 -UNDEBUG -Xcudafe --diag_suppress=20208 -U__CUDA_NO_HALF_OPERATORS__ -U__CUDA_NO_HALF_CONVERSIONS__ -U__CUDA_NO_HALF2_OPERATORS__ -U__CUDA_NO_HALF2_CONVERSIONS__ -U__CUDA_NO_BFLOAT16_OPERATORS__ -U__CUDA_NO_BFLOAT16_CONVERSIONS__ -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ --threads=2 --expt-relaxed-constexpr --expt-extended-lambda --generate-line-info --ptxas-options=--allow-expensive-optimizations=true -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/home/theduc/nunchaku/third_party/spdlog/include/spdlog/fmt/bundled/format.h(4449): error: a literal operator template must have a template parameter list equivalent to "<char ...>"
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building editable for nunchaku
Failed to build nunchaku
ERROR: ERROR: Failed to build installable wheels for some pyproject.toml based projects (nunchaku)`
The text was updated successfully, but these errors were encountered: