kvcache-ai-ktransformers/ktransformers
2025-02-25 16:53:21 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext Merge remote-tracking branch 'upstream/develop-0.2.2' into support-fp8 2025-02-24 11:58:10 +00:00
models optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00
operators Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00
optimize Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00
server Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00
tests Merge remote-tracking branch 'upstream/develop-0.2.2' into support-fp8 2025-02-24 11:58:10 +00:00
util Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py 🔖 release v0.2.1.post1 2025-02-18 20:45:48 +08:00
local_chat.py Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00