koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-21 18:52:02 +00:00

History

Gaurav Garg fd6ae4ca1c Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE (#22129 ) * Fix delayed AllReduce on Gemma-4 MoE Skip forward past nodes that don't consume the current one, and allow a chain of MULs. * Check for all sources before skipping nodes * Address review comments		2026-04-20 18:25:39 +02:00
..
cmake	ggml: backend-agnostic tensor parallelism (experimental) (#19378 )	2026-04-09 16:42:19 +02:00
include	CUDA: manage NCCL communicators in context (#21891 )	2026-04-15 15:58:40 +02:00
src	Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE (#22129 )	2026-04-20 18:25:39 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	cmake: remove CMP0194 policy to restore MSVC builds (#21934 )	2026-04-19 10:25:05 +03:00