koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-19 16:31:59 +00:00

History

Chenguang Li aa4711d369 CANN: Improve ACL graph matching (#16166 ) * CANN: improve ACL graph matching Record `ne` and `nb` information for src tensors and include them in the graph matching check. This enhances the robustness of ACL graph matching by preventing incorrect matches when src tensors share the same data address but differ in shape or stride. * CANN: add op_params match		2025-10-09 15:50:25 +08:00
..
ggml-blas	sync : whisper.cpp (ggml/1359)	2025-09-29 17:43:58 +03:00
ggml-cann	CANN: Improve ACL graph matching (#16166 )	2025-10-09 15:50:25 +08:00
ggml-cpu	kleidiai: kernel interface refactoring (#16460 )	2025-10-09 10:29:17 +03:00
ggml-cuda	Disable CUDA host buffers on integrated GPUs (#16308 )	2025-10-08 20:21:46 +02:00
ggml-hip	HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 (#16221 )	2025-10-01 23:09:25 +02:00
ggml-metal	metal : mark FA blocks (#16372 )	2025-10-08 10:57:53 +03:00
ggml-musa	musa: update compile flags (#16265 )	2025-10-02 16:29:56 +03:00
ggml-opencl	opencl: support pad_ext (#15888 )	2025-09-30 10:45:45 -07:00
ggml-rpc	rpc : check src buffer when copying tensor (#16421 )	2025-10-04 16:22:45 +03:00
ggml-sycl	[SYCL] refactor soft_max, add soft_max_back (#16472 )	2025-10-09 10:25:11 +03:00
ggml-vulkan	vulkan: use a more appropriate amount of threads when generating shaders (#16418 )	2025-10-04 22:04:27 +02:00
ggml-webgpu	ggml webgpu: profiling, CI updates, reworking of command submission (#16452 )	2025-10-07 13:48:56 -07:00
ggml-zdnn	zdnn: refactor codebase + add docs (#16178 )	2025-09-23 14:53:05 +08:00
CMakeLists.txt	cmake : fix static linking for OpenMP on Unix-like systems (#16031 )	2025-09-18 23:07:18 +02:00
ggml-alloc.c	ggml : fix graph reallocation with multiple chunks (#16396 )	2025-10-03 13:49:08 +02:00
ggml-backend-impl.h	rpc : add support for multiple devices (#16276 )	2025-10-04 12:49:16 +03:00
ggml-backend-reg.cpp	ggml-backend : add root cause in error message if loading backend library fails (#16172 )	2025-09-29 13:17:09 +02:00
ggml-backend.cpp	llama: print memory breakdown on exit (#15860 )	2025-09-24 16:53:48 +02:00
ggml-common.h	llama : add gpt-oss (#15091 )	2025-08-05 22:10:36 +03:00
ggml-impl.h	model : Apertus model implementation (#15852 )	2025-10-02 20:43:22 +03:00
ggml-opt.cpp	finetune: SGD optimizer, more CLI args (#13873 )	2025-08-14 12:03:57 +02:00
ggml-quants.c	ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (#15928 )	2025-09-23 10:25:20 +02:00
ggml-quants.h	llama : add gpt-oss (#15091 )	2025-08-05 22:10:36 +03:00
ggml-threading.cpp	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
ggml-threading.h	remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797 )	2024-12-12 19:02:49 +01:00
ggml.c	ggml webgpu: add support for soft_max, optimize rms_norm (#16357 )	2025-10-02 11:00:31 -07:00
ggml.cpp	ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)	2025-06-01 13:43:57 +03:00
gguf.cpp	gguf: gguf_writer refactor (#15691 )	2025-09-05 11:34:28 +02:00