kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2025-09-05 12:09:48 +00:00

History

qiyuxinlin 3a044e6b14 change test		2025-04-22 12:50:39 +00:00
..
AIME_2024	⚡ update compile option for avx512vpopcntdq	2025-03-06 12:18:04 +08:00
humaneval	fix flashinfer precision	2025-03-07 14:07:00 +00:00
.gitignore	⚡ release v0.2.3	2025-03-05 20:21:04 +08:00
dequant_gpu.py	[fix] format classes and files name	2024-08-15 10:44:59 +08:00
dequant_gpu_t.py	[fix] format classes and files name	2024-08-15 10:44:59 +08:00
function_call_test.py	roll back ktransformers backend, add max_tokens, max_completion_tokens param	2025-04-21 12:55:37 +00:00
mmlu_pro_test.py	⚡ add humaneval support	2025-03-04 20:54:49 +08:00
mmlu_test.py	change test	2025-04-22 12:50:39 +00:00
mmlu_test_multi.py	change test	2025-04-22 12:50:39 +00:00
score.py	fix params	2025-03-20 18:48:51 +08:00
test_client.py	change test	2025-04-22 12:50:39 +00:00
test_pytorch_q8.py	merge main; Add torch q8 linear	2025-03-14 05:52:07 -04:00
test_speed.py	change test	2025-04-22 12:50:39 +00:00
triton_fp8gemm_test.py	Add data loader to read special weights for fp8; Add special weight process script	2025-02-24 11:34:17 +00:00