kvcache-ai-ktransformers/ktransformers/models
2025-02-14 19:40:36 +08:00
..
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
configuration_deepseek.py Initial commit 2024-07-27 16:06:58 +08:00
configuration_deepseek_v3.py update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
configuration_llama.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
custom_cache.py linux support triton MLA kernel 2025-02-14 11:38:55 +00:00
modeling_deepseek.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
modeling_deepseek_v3.py update FAQ 2025-02-12 08:50:58 +00:00
modeling_llama.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
modeling_mixtral.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
modeling_qwen2_moe.py Initial commit 2024-07-27 16:06:58 +08:00