-
Notifications
You must be signed in to change notification settings - Fork 10.8k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Misc. bug: add tool_calls id in response in server
bug-unconfirmed
#11992
opened Feb 21, 2025 by
henryclw
Feature Request: add Kernel level verbose option
enhancement
New feature or request
#11985
opened Feb 20, 2025 by
0400H
4 tasks done
Misc. bug: llama-cli '--log-disable' parameter omits response
bug-unconfirmed
#11983
opened Feb 20, 2025 by
nmandic78
Eval bug: CANNOT LINK EXECUTABLE "./llama-cli": library "libomp.so" not found: needed by main executable
bug-unconfirmed
#11979
opened Feb 20, 2025 by
Krallbe68
GGML to GGUF FAIL Quantized tensor bytes per row (5120) is not a multiple of Q2_K type size (84)
#11976
opened Feb 20, 2025 by
chokoon123
tensor 'blk.25.ffn_down.weight' has invalid ggml type 42 (NONE)
bug-unconfirmed
#11975
opened Feb 20, 2025 by
evaninf
Misc. bug: Sporadic MUL_MAT Failures in test-backend-ops for Nvidia backend
bug-unconfirmed
#11972
opened Feb 20, 2025 by
ShanoToni
Misc. bug: The KV cache is sometimes truncated incorrectly when making v1/chat/completions API calls
bug-unconfirmed
#11970
opened Feb 20, 2025 by
vnicolici
Eval bug: Ram boom after using llama-bench with cuda12.8 and deepseekr1q6
bug-unconfirmed
#11965
opened Feb 20, 2025 by
Xxianna
Misc. bug: Rpc-server does not use opencl backend on Android.
bug-unconfirmed
#11957
opened Feb 19, 2025 by
belog2867
Misc. bug: Segmentation fault when importing model to opencl buffer
bug-unconfirmed
#11953
opened Feb 19, 2025 by
zhouzengming
Eval bug: llama.cpp Incorrectly Parses and Reports sprintf Calls in C++ Code
bug-unconfirmed
#11951
opened Feb 19, 2025 by
perdubug
Misc. bug: hipGraph causes a crash in hipGraphDestroy
AMD GPU
Issues specific to AMD GPUs
#11949
opened Feb 18, 2025 by
IMbackK
Eval bug: Segmentation fault with Docker ROCm image "full-rocm"
bug-unconfirmed
#11947
opened Feb 18, 2025 by
JFingerle
Add option to build CUDA backend without Flash attention
enhancement
New feature or request
#11946
opened Feb 18, 2025 by
slaren
Feature Request: 推理minicpmv时,encoding_image_with_clip耗时很久
enhancement
New feature or request
#11941
opened Feb 18, 2025 by
EnzhiZhou
4 tasks done
Enhancement: Improve ROCm performance on various quants (benchmarks included)
enhancement
New feature or request
#11931
opened Feb 17, 2025 by
cb88
4 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2025-02-17.