Concedo
a530afa1e4
Merge commit ' 280345968d
' into concedo_experimental
...
# Conflicts:
# .devops/full-cuda.Dockerfile
# .devops/llama-cpp-cuda.srpm.spec
# .devops/main-cuda.Dockerfile
# .devops/nix/package.nix
# .devops/server-cuda.Dockerfile
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README.md
# ci/run.sh
# docs/token_generation_performance_tips.md
# flake.lock
# llama.cpp
# scripts/LlamaConfig.cmake.in
# scripts/compare-commits.sh
# scripts/server-llm.sh
# tests/test-quantize-fns.cpp
2024-04-07 20:27:17 +08:00
Concedo
7eccc5ffa6
change listen count, fix null
2024-02-16 16:01:24 +08:00
Concedo
5639c1a520
units (+2 squashed commit)
...
Squashed commit:
[166979d9] units coversion
[038dd5d4] get rid of all warnings (+1 squashed commits)
Squashed commits:
[6efd1e1b] get rid of all warnings
2024-01-20 23:53:21 +08:00
Concedo
db14de5c32
fossilize ggml library ver 3, to support ggjtv3
2024-01-20 10:49:25 +08:00
Concedo
76a3ba42eb
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# ggml.c
# ggml.h
# requirements.txt
# tests/test-quantize-perf.cpp
2023-12-16 22:58:53 +08:00
Concedo
a6eb9b8010
Fix GPT2 not loading due to graph too small
2023-11-26 23:06:42 +08:00
Concedo
dc4078c039
fixed segfault with all non-gguf models
2023-11-20 22:31:56 +08:00
Concedo
5925c23d51
fix for RWKV
2023-09-01 17:02:11 +08:00
Concedo
b8372d4466
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .gitignore
# README.md
# tests/CMakeLists.txt
2023-08-24 15:21:24 +08:00
Concedo
523fc3be52
fixed rwkv, standardized new ctx usage
2023-07-10 20:05:53 +08:00
Concedo
2827920044
fix compile errors, rwkv not working
2023-07-10 18:23:25 +08:00
Concedo
ca9a11697c
possibly slower, but cannot use larger batches without modifying ggml library.
2023-07-04 00:35:02 +08:00
Concedo
ef3b8dc0d9
GPU accel for rwkv is slow, disable it
2023-07-02 00:41:46 +08:00
Concedo
e1a7042943
try out the new rwkv but it seems worse, may revert
2023-07-02 00:10:56 +08:00
Concedo
3ed3e7b7e2
reverted sequence mode for rwkv due to multiple issues with speed loss with bigger quantized models
2023-06-14 20:03:14 +08:00
Concedo
860fb026df
rwkv compile fix (+1 squashed commits)
...
Squashed commits:
[8b0ebb1] upgraded rwkv + added memory overheads + added state_out bufs
2023-06-12 23:04:40 +08:00
Concedo
97b39f875c
fixed fstat64 build error on mac
2023-05-29 15:50:07 +08:00
Concedo
55e0fbf024
wip integrating new rwkv
2023-05-27 22:45:28 +08:00