koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-07 09:02:04 +00:00

Author	SHA1	Message	Date
kallewoof	b7b3e0d2a7	add adapter tests for autoguess (#1654 )	2025-07-25 22:14:18 +08:00
kallewoof	ff8f156fa0	AutoGuess tests (#1650 ) * whitespace * AutoGuess remove dot suffix in names * .gitignore update * test: added autoguess test suite * github workflow to run autoguess test when appropriate * git clone unavailable tokenizer configs rather than committing to repo * fix link to included tokenizer configs * skip storing downloaded tokenizer configs * typo * minor fixes * clean-up * limit workflow to trigger from experimental branch --------- Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>	2025-07-25 19:21:00 +08:00
Concedo	2a59adce0f	stay on macos 14	2025-07-16 15:47:33 +08:00
Concedo	aa3623dcce	remove unwanted workflow	2025-07-13 23:43:56 +08:00
Concedo	8cebec5128	Merge branch 'upstream' into concedo_experimental # Conflicts: # CMakePresets.json # README.md # common/CMakeLists.txt # ggml/src/ggml-cann/ggml-cann.cpp # ggml/src/ggml-opencl/CMakeLists.txt # ggml/src/ggml-opencl/ggml-opencl.cpp # ggml/src/ggml-sycl/ggml-sycl.cpp # scripts/sync-ggml.last # tests/test-backend-ops.cpp # tools/run/CMakeLists.txt	2025-07-13 23:39:41 +08:00
Aman Gupta	11ee0fea2a	Docs: script to auto-generate ggml operations docs (#14598 ) * Docs: script to auto-generate ggml operations docs * Review: formatting changes + change github action * Use built-in types instead of typing * docs : add BLAS and Metal ops --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2025-07-10 23:29:01 +08:00
Jeff Bolz	53903ae6fa	vulkan: increase timeout for CI (#14574 )	2025-07-08 09:38:31 +02:00
Georgi Gerganov	d4cdd9c1c3	ggml : remove kompute backend (#14501 ) ggml-ci	2025-07-03 07:48:32 +03:00
Rotem Dan	f3ed38d793	Set RPATH to "@loader_path" / "$ORIGIN" to ensure executables and dynamic libraries search for dependencies in their origin directory. (#14309 )	2025-07-02 18:37:16 +02:00
Georgi Gerganov	de56944147	ci : disable fast-math for Metal GHA CI (#14478 ) * ci : disable fast-math for Metal GHA CI ggml-ci * cont : remove -g flag ggml-ci	2025-07-01 18:04:08 +03:00
Sigbjørn Skjæret	6609507a91	ci : fix windows build and release (#14431 )	2025-06-28 09:57:07 +02:00
bandoti	ce82bd0117	ci: add workflow for relocatable cmake package (#14346 )	2025-06-23 15:30:51 -03:00
Jeff Bolz	bf2a99e3cb	vulkan: update windows SDK in release.yml (#14344 )	2025-06-23 15:44:48 +02:00
Jeff Bolz	3a9457df96	vulkan: update windows SDK in CI (#14334 )	2025-06-23 10:19:24 +02:00
Concedo	abc1d8ac25	better way of checking for avx2 support	2025-06-22 22:56:50 +08:00
Concedo	52dcfe42d6	try auto selecting correct backend while checking intrinsics	2025-06-22 18:16:02 +08:00
Concedo	ce58d1253f	fixed build and workflow	2025-06-21 00:56:27 +08:00
Diego Devesa	6adc3c3ebc	llama : add thread safety test (#14035 ) * llama : add thread safety test * llamafile : remove global state * llama : better LLAMA_SPLIT_MODE_NONE logic when main_gpu < 0 GPU devices are not used --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2025-06-16 08:11:43 -07:00
bandoti	0dbcabde8c	cmake: clean up external project logic for vulkan-shaders-gen (#14179 ) * Remove install step for vulkan-shaders-gen * Add install step to normalize msvc with make * Regenerate modified shaders at build-time	2025-06-16 10:32:13 -03:00
Concedo	5cdb2d3fc6	cleanup	2025-06-11 01:35:40 +08:00
Jeff Bolz	652b70e667	vulkan: force device 0 in CI (#14106 )	2025-06-10 10:53:47 -05:00
Concedo	8386546e08	Switched VS2019 for revert cu12.1 build, hopefully solves dll issues try change order (+3 squashed commit) Squashed commit: [457f02507] try newer jimver [`64af28862`] windows pyinstaller shim. the final loader will be moved into the packed directory later. [`0272ecf2d`] try alternative way of getting cuda toolkit 12.4 since jimver wont work, also fix rocm try again (+3 squashed commit) Squashed commit: [133e81633] try without pwsh [4d99cefba] try without pwsh [bdfa91e7d] try alternative way of getting cuda toolkit 12.4, also fix rocm	2025-06-10 23:08:02 +08:00
Diego Devesa	7f4fbe5183	llama : allow building all tests on windows when not using shared libs (#13980 ) * llama : allow building all tests on windows when not using shared libraries * add static windows build to ci * tests : enable debug logs for test-chat --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2025-06-09 20:03:09 +02:00
Concedo	28b35ca879	allow wmma flag for rocm	2025-06-10 01:23:48 +08:00
Concedo	deece4be69	missed a build target	2025-06-09 17:05:56 +08:00
Concedo	6c5c8be48d	try to make rocm work for the github ci, requires disabling rocwmma	2025-06-08 21:52:29 +08:00
Concedo	7132d6b15c	test rocm rolling (+1 squashed commits) Squashed commits: [43c8f7fc6] test rocm rolling (+4 squashed commit) Squashed commit: [16a60aa77] test clobber 4 [a6c866450] test clobber 3 [9322f17f6] test clobber 2 [b7a420cbe] testing clobber	2025-06-08 15:33:05 +08:00
吴小白	5787b5da57	ci: add LoongArch cross-compile build (#13944 )	2025-06-07 10:39:11 -03:00
Concedo	abc272d89f	breaking change: standardize ci binary names	2025-06-07 00:40:46 +08:00
Concedo	6effb65cfe	change singleinstance order	2025-06-06 21:20:30 +08:00
Concedo	8b141d8647	stick to cu12.1 for linux for now	2025-06-06 17:38:28 +08:00
Concedo	eec5a8ad16	breaking change: due to cuda12 upgrade, release filenames will change. standardize them to windows naming for the future. (+1 squashed commits) Squashed commits: [75842919a] cuda12.4 test	2025-06-06 14:02:34 +08:00
Concedo	50a27793d3	upgrade windows runners to windows 2022, cu11 still uses vs2019 this should finally work (+21 squashed commit) Squashed commit: [5edac5b59] Revert "quick dbg" This reverts commit fd62a997cc6684bb89242d5e7b0ae2aed83fd27f. [fd62a997c] quick dbg [bcccae7e6] sanity check 2 [568e2eb08] sanity check [2f30d573a] please work 2 [cf8765221] please work [c535e60d9] try a small trick [d4ba79b80] 2022 test [3f146b000] t2 [4a3b9a9b4] revert and test [4bdc9a149] reverted test2 [5081cb4a3] reverted test [ea9a826f3] broken test [3c11ae389] compare 2019 [8ecec4fec] not for cu12 [0be964f3a] added vs2019 for the other runners [5d24641cb] debugging 4 [1dee79207] debugging 3 [ab172f133] more debugging 2 [b1a895e84] more debugging [5d21d8bd0] vs2019 setup	2025-06-06 14:02:34 +08:00
Concedo	a341188f84	add install for vs2019	2025-06-05 10:32:57 +08:00
Concedo	a74d8669b3	try hardcoded path (+1 squashed commits) Squashed commits: [711b43d9d] let's see if VS2019 can work	2025-06-05 10:26:02 +08:00
Diego Devesa	2589ad3704	ci : remove cuda 11.7 releases, switch runner to windows 2022 (#13997 )	2025-06-04 15:37:40 +02:00
Diego Devesa	482548716f	releases : use dl backend for linux release, remove arm64 linux release (#13996 )	2025-06-04 13:15:54 +02:00
Concedo	f3bb947a13	cuda use wmma flash attention for turing (+1 squashed commits) Squashed commits: [3c5112398] 117 (+10 squashed commit) Squashed commit: [4f01bb2d4] 117 graphs 80v [7549034ea] 117 graphs [dabf9cb99] checking if cuda 11.5.2 works [ba7ccdb7a] another try cu11.7 only [752cf2ae5] increase aria2c download log rate [dc4f198fd] test send turing to wmma flash attention [496a22e83] temp build test cu11.7.0 [ca759c424] temp build test cu11.7 [c46ada17c] test build: enable virtual80 for oldcpu [3ccfd939a] test build: with cuda graphs for all	2025-06-01 11:41:45 +08:00
bandoti	d98f2a35fc	ci: disable LLAMA_CURL for Linux cross-builds (#13871 )	2025-05-28 15:46:47 -03:00
henk717	b8883e254a	KoboldCpp.sh updates (#1562 ) * YR makefile upstream * Create make_portable_rocm_libs.sh * update makefile, support llama portable, ditch all unnecessary changes * Delete make_portable_rocm_libs.sh should not be needed * koboldcpp.sh updates * Small rocm fixes * ROCm is now a cuda version not a command * Don't commit temp file * Don't commit temp file * 1200 has errors, removing it for now * Only rebuild rocm with rebuild * Update kcpp-build-release-linux.yaml * Fix rocm filename * ROCm Linux CI * We need more diskspace * Workaround for lockfile getting stuck Why do I have to do hacks like this.... * Update kcpp-build-release-linux-rocm.yaml * Dont apt update rocm You don't allow us to apt update? Better not break things github! * Container maybe? * Turns out we aren't root, so we use sudo * Cleanup ROCm CI PR * Build for Runpods GPU * We also need rocblas * More cleanup just in case * Update kcpp-build-release-linux-rocm.yaml --------- Co-authored-by: LostRuins Concedo <39025047+LostRuins@users.noreply.github.com>	2025-05-26 15:24:49 +08:00
Diego Devesa	a2d02d5793	releases : bundle llvm omp library in windows release (#13763 )	2025-05-25 00:55:16 +02:00
Diego Devesa	17fc817b58	releases : enable openmp in windows cpu backend build (#13756 )	2025-05-24 22:27:03 +02:00
Concedo	0dca953d78	removed winget workflow	2025-05-24 16:40:39 +08:00
Concedo	55cc9acec5	Merge branch 'upstream' into concedo_experimental # Conflicts: # .github/workflows/release.yml # README.md # ggml/src/ggml-cann/aclnn_ops.cpp # ggml/src/ggml-cann/ggml-cann.cpp # tools/mtmd/CMakeLists.txt # tools/mtmd/clip.cpp # tools/mtmd/clip.h	2025-05-24 12:10:36 +08:00
Diego Devesa	b775345d78	ci : enable winget package updates (#13734 )	2025-05-23 23:14:00 +03:00
Diego Devesa	a70a8a69c2	ci : add winget package updater (#13732 )	2025-05-23 22:09:38 +02:00
Diego Devesa	3079e9ac8e	release : fix windows hip release (#13707 ) * release : fix windows hip release * make single hip release with multiple targets	2025-05-23 00:21:37 +02:00
Concedo	fdca5ba71e	declutter	2025-05-22 22:58:47 +08:00
Concedo	8bd6f9f9ae	added a simple cross platform launch script for unpacked dirs	2025-05-22 22:09:46 +08:00
Diego Devesa	d643bb2c79	releases : build CPU backend separately (windows) (#13642 )	2025-05-21 22:09:57 +02:00

1 2 3 4 5 ...

436 commits