Llama.cpp Vulkan support #126

atirut-w · 2025-01-06T13:15:51Z

Using Llama.cpp with Vulkan support should provide GPU acceleration on a wide range of graphics cards. It is also easier to integrate than Ollama. For a good example, see LM Studio.

FrancescoCaracciolo · 2025-01-06T13:53:04Z

LM studio supports OpenAI like API, so you can use the OpenAI Handler and running it externally, if it solves your use case.
Maybe we can add a specialized handler that can do a few other things like for ollama.

For direct llama.cpp support, getting hardware acceleration work under Flatpak always requires more effort. I will check what I can do

FrancescoCaracciolo added the enhancement New feature or request label Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama.cpp Vulkan support #126

Llama.cpp Vulkan support #126

atirut-w commented Jan 6, 2025

FrancescoCaracciolo commented Jan 6, 2025

Llama.cpp Vulkan support #126

Llama.cpp Vulkan support #126

Comments

atirut-w commented Jan 6, 2025

FrancescoCaracciolo commented Jan 6, 2025