Vulkan backend & ggml-vulkan-shaders.hpp #7323

mjgarton · 2024-05-16T10:29:32Z

mjgarton
May 16, 2024

I am just wondering how the contents of ggml-vulkan-shaders.hpp was generated? I would like to learn about how the vulkan backend works.

If I understand correctly, this is compiled spirv shader code. Why store the compiled code as bytes here, rather than storing source and generating the binary at build time? Would this improve maintainability & transparency?

PS. I am fairly new to llama.cpp, GPUs, vulkan, shaders and ML in general, so apologies if my questions are somewhat confused.

Answered by teleprint-me

May 23, 2024

The shaders are generated using the ggml_vk_generate_shaders.py script. You shouldn't need to use it unless the source is modified. The shaders are included to smooth out compilation and runtime.

You can see the rationale behind most of the design decisions in the initial PR #2059.

View full answer

teleprint-me · 2024-05-23T03:29:15Z

teleprint-me
May 23, 2024

The shaders are generated using the ggml_vk_generate_shaders.py script. You shouldn't need to use it unless the source is modified. The shaders are included to smooth out compilation and runtime.

You can see the rationale behind most of the design decisions in the initial PR #2059.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vulkan backend & ggml-vulkan-shaders.hpp #7323

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Vulkan backend & ggml-vulkan-shaders.hpp #7323

mjgarton May 16, 2024

Replies: 1 comment

teleprint-me May 23, 2024

mjgarton
May 16, 2024

teleprint-me
May 23, 2024