kvcache-ai-ktransformers/ktransformers
2024-09-05 05:29:23 +00:00
..
configs [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
ktransformers_ext [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
models [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
operators fix qlen > 1000 mask is none error 2024-09-02 02:58:10 +00:00
optimize [fix] bugs about Qwen57B, install requirement, Dockerfile 2024-08-30 09:51:32 +00:00
server [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
tests [fix] format classes and files name 2024-08-15 10:44:59 +08:00
util Fix: the tokens return by prefill_and_generate 2024-09-05 05:29:23 +00:00
website Initial commit 2024-07-27 16:06:58 +08:00
__init__.py update yaml example; update version idx; update docker file 2024-08-29 22:39:20 +08:00
local_chat.py Fix cannot offload whole layer in cpu 2024-08-29 19:10:14 +08:00