Commit graph

6350 commits

Author SHA1 Message Date
Concedo
ddfd82bca1 test 2024-12-09 23:54:33 +08:00
Concedo
0b30e45daa test 2024-12-09 23:48:30 +08:00
Concedo
c3bfece55e messed up 2024-12-09 23:44:11 +08:00
Concedo
2a4b37fe0d Revert "test"
This reverts commit 80a1fcaeaf.
2024-12-09 23:04:33 +08:00
Concedo
80a1fcaeaf test 2024-12-09 22:39:31 +08:00
Concedo
e2aa7d944d test 2024-12-09 22:18:44 +08:00
Concedo
264d802007 test 2024-12-09 22:00:54 +08:00
Concedo
f5b123173d undo 2024-12-09 17:32:10 +08:00
Concedo
1ffacc4844 test 2024-12-09 17:17:01 +08:00
Concedo
63c0be926a undo 2024-12-09 17:14:59 +08:00
Concedo
510e0377e6 ofast try fix 2024-12-09 17:12:32 +08:00
Concedo
4ac199b208 try fix sigill 2024-12-09 17:06:37 +08:00
Concedo
1bc987ba29 try fix illegal instruction 2024-12-09 16:56:28 +08:00
Concedo
7697252b17 edit 2024-12-07 17:48:31 +08:00
Concedo
f87087b286 check gcc ver 2024-12-07 17:18:30 +08:00
Concedo
e9dfe2cefd try using qemu to do the pyinstaller 2024-12-07 00:53:35 +08:00
Concedo
b411192db4 revert 2024-12-07 00:33:46 +08:00
Concedo
25b5301e5a try using qemu to do the pyinstaller 2024-12-07 00:24:35 +08:00
Concedo
58038cddc3 try using qemu to do the pyinstaller 2024-12-07 00:19:13 +08:00
Concedo
e9d2332dd8 improved tool calls and whisper 2024-12-06 14:34:31 +08:00
Concedo
836c06d91a minor edit 2024-12-06 00:37:38 +08:00
Concedo
746cb01843 remove test since it wont work on x64 2024-12-06 00:26:58 +08:00
Concedo
65a11451e3 fix missing bundled files 2024-12-06 00:21:08 +08:00
Concedo
fe72c8db9f CI for ARM should appear as ARM 2024-12-06 00:12:30 +08:00
Concedo
5cddd0a878 Merge branch 'concedo' into concedo_experimental 2024-12-05 23:58:31 +08:00
Concedo
ece96e19bf clean up makefile 2024-12-05 23:58:23 +08:00
Concedo
8d5bb06aeb test aarch64 ci workflow 2024-12-05 23:57:25 +08:00
Concedo
d0d1d922de handle and fix temp paths to chat completions adapter 2024-12-05 17:22:35 +08:00
Concedo
5106816eac drafted tokens debug prints 2024-12-05 17:05:20 +08:00
Concedo
2787fca6b4 refactored library selection, fixed ollama params 2024-12-05 16:47:52 +08:00
Concedo
52cc908f7f default trim_stop to true, which trims any tokens after a stop sequence and the stop sequence itself. This is potentially a breaking change. 2024-12-03 22:44:10 +08:00
Concedo
7d11d2946c only show warning if more than 1 moved tensor 2024-12-03 22:09:26 +08:00
Ikko Eltociear Ashimine
ed9e229372
docs: update README.md (#1244)
recomended -> recommended
2024-12-02 17:20:20 +08:00
Concedo
2ba5949054 updated sdcpp, also set euler as default sampler 2024-12-01 17:00:20 +08:00
Concedo
e93c2427b4 allow incompatible vocab in debugmode 2024-12-01 14:11:03 +08:00
Concedo
42228b9746 warning when selecting non gguf models 2024-12-01 13:35:51 +08:00
Concedo
d5e732f3ab updated lite 2024-12-01 01:49:09 +08:00
Concedo
b7cd210cd2 more linting with Ruff (+1 squashed commits)
Squashed commits:

[43802cfe2] Applied default Ruff linting
2024-12-01 01:23:13 +08:00
Concedo
409e393d10 fixed critical bug in image model loader 2024-11-30 23:28:24 +08:00
Concedo
153da19274 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	README.md
2024-11-30 16:59:25 +08:00
Concedo
0028e71993 special handling to resolve incomplete utf8 token sequences in qwen 2024-11-30 16:54:01 +08:00
Concedo
32ac3153e4 default speculative set to 8. added more adapter fields 2024-11-30 16:18:27 +08:00
Georgi Gerganov
3e0ba0e604
readme : remove old badge 2024-11-30 10:09:21 +02:00
Georgi Gerganov
abadba05be
readme : refresh (#10587)
* readme : refresh

* readme : move section [no ci]

* readme : clarify [no ci]

* readme : fixes [no ci]

* readme : more fixes [no ci]

* readme : simplify [no ci]

* readme : clarify GGUF
2024-11-30 09:47:07 +02:00
Eve
0533e7fb38
vulkan: Dynamic subgroup size support for Q6_K mat_vec (#10536)
* subgroup 64 version with subgroup add. 15% faster

scalable version

tested for subgroup sizes 16-128

* check for subgroup multiple of 16 and greater than 16

* subgroup sizes are always a power of 2 (https://github.com/KhronosGroup/GLSL/issues/45)

* force 16 sequential threads per block

* make 16 subgroup size a constant
2024-11-30 08:00:02 +01:00
Concedo
5353bfa983 updated lite 2024-11-30 12:26:20 +08:00
Concedo
557bcaf86e Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.clang-tidy
#	.github/workflows/build.yml
#	Makefile
#	Package.swift
#	common/CMakeLists.txt
#	examples/batched-bench/CMakeLists.txt
#	examples/batched/CMakeLists.txt
#	examples/convert-llama2c-to-ggml/CMakeLists.txt
#	examples/cvector-generator/CMakeLists.txt
#	examples/embedding/CMakeLists.txt
#	examples/eval-callback/CMakeLists.txt
#	examples/export-lora/CMakeLists.txt
#	examples/gbnf-validator/CMakeLists.txt
#	examples/gguf-split/CMakeLists.txt
#	examples/gguf/CMakeLists.txt
#	examples/gritlm/CMakeLists.txt
#	examples/imatrix/CMakeLists.txt
#	examples/infill/CMakeLists.txt
#	examples/llama-bench/CMakeLists.txt
#	examples/llava/CMakeLists.txt
#	examples/lookahead/CMakeLists.txt
#	examples/lookup/CMakeLists.txt
#	examples/main-cmake-pkg/CMakeLists.txt
#	examples/main/CMakeLists.txt
#	examples/parallel/CMakeLists.txt
#	examples/passkey/CMakeLists.txt
#	examples/perplexity/CMakeLists.txt
#	examples/quantize-stats/CMakeLists.txt
#	examples/quantize/CMakeLists.txt
#	examples/retrieval/CMakeLists.txt
#	examples/run/CMakeLists.txt
#	examples/save-load-state/CMakeLists.txt
#	examples/server/CMakeLists.txt
#	examples/simple-chat/CMakeLists.txt
#	examples/simple/CMakeLists.txt
#	examples/speculative-simple/CMakeLists.txt
#	examples/speculative/CMakeLists.txt
#	examples/tokenize/CMakeLists.txt
#	ggml/CMakeLists.txt
#	ggml/src/CMakeLists.txt
#	ggml/src/ggml-backend.cpp
#	ggml/src/ggml-cpu/CMakeLists.txt
#	ggml/src/ggml-vulkan/vulkan-shaders/CMakeLists.txt
#	pocs/vdot/CMakeLists.txt
#	src/CMakeLists.txt
#	src/unicode.cpp
#	tests/test-sampling.cpp
2024-11-30 12:24:51 +08:00
Concedo
697ca70115 temp checkpoint 2024-11-30 12:13:20 +08:00
Concedo
ec95241e38 temp checkpoint 2024-11-30 11:59:27 +08:00
Concedo
0c8939be19 temp checkpoint 2024-11-30 11:57:28 +08:00